Commit Graph

14 Commits

Author SHA1 Message Date
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga
d86ea314a1 support val set in streaming mode 2023-08-09 23:00:26 +08:00
hiyouga
08f180e788 modify code structure 2023-08-02 23:17:36 +08:00
hiyouga
1d8a1878ea fix PPO trainer 2023-08-02 19:10:23 +08:00
hiyouga
b5ba87952a update ppo trainer 2023-08-02 18:46:41 +08:00
hiyouga
286f7be346 fix memory leak of PPO trainer 2023-08-02 17:41:34 +08:00
hiyouga
ac88ce5233 fix RM save model 2023-08-01 11:56:17 +08:00
hiyouga
0411a4b3e1 support streaming data, fix #284 #274 #268 2023-07-31 23:33:00 +08:00
hiyouga
29af67b015 fix API 2023-07-19 00:01:14 +08:00
hiyouga
12d8a8633f update webUI, fix #179 2023-07-18 15:35:17 +08:00
hiyouga
f8193e8009 release v0.1.0 2023-07-18 00:18:25 +08:00
hiyouga
85c2210452 fix #175 2023-07-17 18:07:17 +08:00
hiyouga
22d9a9c2af fix callback 2023-07-15 17:18:16 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00