Commit Graph

21 Commits

Author SHA1 Message Date
hiyouga
d8aa1404be support FlashAttention2 2023-09-10 20:43:56 +08:00
hiyouga
8ea32e4046 change to right-padding, update reward score #803 2023-09-08 20:04:31 +08:00
hiyouga
a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
codemayq
ba94c8729d add stage in DatasetAttr 2023-08-23 20:54:53 +08:00
hiyouga
4318347d3f update template 2023-08-22 19:46:09 +08:00
hiyouga
53e33418d0 support ppo score norm (trl 0.5.1.dev required) 2023-08-18 12:02:42 +08:00
hiyouga
9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga
7407d9daa1 fix system prompt 2023-08-16 01:35:52 +08:00
hiyouga
fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hiyouga
a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
jiongxuc
3e000c2b60 huggingface login for projects must login while running 2023-08-10 14:57:12 +08:00
hiyouga
d86ea314a1 support val set in streaming mode 2023-08-09 23:00:26 +08:00
hiyouga
5453b93db0 update args spec 2023-08-07 15:23:35 +08:00
hiyouga
69744c17e8 support interleave probs 2023-08-04 21:27:35 +08:00
hiyouga
87f8f830e2 support Qwen-7B, fix InternLM-7B inference 2023-08-03 15:53:32 +08:00
hiyouga
0411a4b3e1 support streaming data, fix #284 #274 #268 2023-07-31 23:33:00 +08:00
hiyouga
513e1f1ec9 Update data_args.py 2023-07-28 17:42:41 +08:00
hiyouga
8f7819fcaa fix #194 2023-07-19 17:07:33 +08:00
hiyouga
657cf0f55a create chat model 2023-07-15 19:26:20 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00