Commit Graph

16 Commits

Author SHA1 Message Date
hiyouga
aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga
a1c8c98c5f fix #2941 2024-03-24 00:28:44 +08:00
hiyouga
638234ceee format style 2024-01-20 20:15:56 +08:00
hiyouga
f6d6e00337 fix tests 2024-01-20 19:58:04 +08:00
hiyouga
38af076a75 support longlora for main branch 2024-01-20 19:25:22 +08:00
hiyouga
d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga
4736344eb1 disentangle model from tuner and rename modules 2023-11-15 16:29:09 +08:00
hiyouga
4bd8e3906d fix flashattn warning 2023-11-10 18:34:54 +08:00
hiyouga
2818af0b09 refactor model_dtype, fix PPO trainer 2023-10-11 23:16:01 +08:00
hiyouga
0a356bc897 fix flash shift short attention 2023-10-09 17:54:48 +08:00
hiyouga
ab65c3063b fix shift short attention 2023-10-09 17:07:46 +08:00
hiyouga
5d4118b096 tiny fix 2023-09-28 01:03:04 +08:00
hiyouga
d2ebd225db tiny fix 2023-09-28 01:02:11 +08:00
hiyouga
c902236397 fix #1064 2023-09-28 00:53:29 +08:00
hiyouga
84b7486885 fix layer norm dtype 2023-09-28 00:25:55 +08:00
hiyouga
90375f600d support LongLoRA 2023-09-27 21:55:50 +08:00