Commit Graph

13 Commits

Author SHA1 Message Date
hiyouga
af18b0dce7 fix #1184 2023-10-14 19:20:11 +08:00
hiyouga
2818af0b09 refactor model_dtype, fix PPO trainer 2023-10-11 23:16:01 +08:00
hiyouga
84b7486885 fix layer norm dtype 2023-09-28 00:25:55 +08:00
hiyouga
90375f600d support LongLoRA 2023-09-27 21:55:50 +08:00
hiyouga
338b8664ed fix #944 2023-09-21 19:51:02 +08:00
hiyouga
d8aa1404be support FlashAttention2 2023-09-10 20:43:56 +08:00
hiyouga
8ea32e4046 change to right-padding, update reward score #803 2023-09-08 20:04:31 +08:00
hiyouga
a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
hiyouga
fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hiyouga
a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
jiongxuc
3e000c2b60 huggingface login for projects must login while running 2023-08-10 14:57:12 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00