hiyouga
|
a9fc7dbfa6
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
5a206d54c9
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
d88c72f326
|
fix flashattn warning
Former-commit-id: 6eb095d39bd82fdbdb729a0ea57fc7246e3a60d6
|
2023-11-10 18:34:54 +08:00 |
|
hiyouga
|
f1a8fcf917
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
7bdcd9d507
|
fix flash shift short attention
Former-commit-id: e44ad23eafa39b3ac0400b6f97cd440106a87f44
|
2023-10-09 17:54:48 +08:00 |
|
hiyouga
|
0c1e00574d
|
fix shift short attention
Former-commit-id: 9a49cce8e6f6b222f74a07bdab40efee6a77b0f1
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
51c6c09f02
|
tiny fix
Former-commit-id: 35b355b76d2a8f8adf3750a905224e52d03d218f
|
2023-09-28 01:03:04 +08:00 |
|
hiyouga
|
d231f97335
|
tiny fix
Former-commit-id: 7451b2ae7e58d0f1857f01a037672a8c53b1bd0d
|
2023-09-28 01:02:11 +08:00 |
|
hiyouga
|
2bacc9789a
|
fix #1064
Former-commit-id: fd4660aa72d981d7efdad465f24a59358626c975
|
2023-09-28 00:53:29 +08:00 |
|
hiyouga
|
4617413bde
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
d6f5a3cae9
|
support LongLoRA
Former-commit-id: 0832ed37e7947d699f17375648a52f80752c2b6b
|
2023-09-27 21:55:50 +08:00 |
|