Commit Graph

16 Commits

Author SHA1 Message Date
hiyouga
a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
codemayq
f7fdc088d4 add dataset stage check 2023-08-30 16:23:08 +08:00
hiyouga
57146c101f fix #649 2023-08-23 20:21:15 +08:00
hiyouga
9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga
58f13e22da update training resuming 2023-08-18 01:41:17 +08:00
hiyouga
d125218cde support bf16 ppo #551 2023-08-18 00:40:32 +08:00
hiyouga
fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga
d86ea314a1 support val set in streaming mode 2023-08-09 23:00:26 +08:00
hiyouga
ac88ce5233 fix RM save model 2023-08-01 11:56:17 +08:00
hiyouga
973a638665 release v0.1.4 2023-08-01 10:08:47 +08:00
hiyouga
9cb1f119a4 fix arg check 2023-07-31 23:48:57 +08:00
hiyouga
62dca5bb82 update readme 2023-07-31 23:42:32 +08:00
hiyouga
0411a4b3e1 support streaming data, fix #284 #274 #268 2023-07-31 23:33:00 +08:00
hiyouga
ed0e186a13 update web UI, support rm predict #210 2023-07-21 13:27:27 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00