58 Commits

Author SHA1 Message Date
hiyouga
a7b1632ace fix value head model resuming
Former-commit-id: 2a36fd5064f028f394ac07c25440fd5e965a07b8
2023-11-20 19:01:37 +08:00
hiyouga
682d81caa9 fix #1567
Former-commit-id: 99a3f06377d2886c4000ce7e3583b12ca965534d
2023-11-20 18:46:36 +08:00
hiyouga
0d98d1a28c fix quantization
Former-commit-id: ccb0f58e22f55b15531fd0e85f5935b150575bec
2023-11-17 22:21:29 +08:00
hiyouga
f9df6c17ed fix #1550
Former-commit-id: 1bbc1be95eedf0796c0b311568dff8c75f87dfbb
2023-11-17 17:23:13 +08:00
hiyouga
3f53155a90 fix bug in web ui
Former-commit-id: 6efa38be46ed536f80fc67002f23862edcb9df8d
2023-11-16 15:21:24 +08:00
hiyouga
678052a7ef fix rlhf callback
Former-commit-id: 1817ffc86fe3463ea91e9359c0e3611979a9d53e
2023-11-16 03:26:19 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
2023-11-16 02:08:04 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00