Commit Graph

62 Commits

Author SHA1 Message Date
hiyouga
5156114981 fix #4357
Former-commit-id: 4bd77d8563
2024-06-18 22:42:45 +08:00
hiyouga
7ef169ed39 fix #4326
Former-commit-id: e2665e71c7
2024-06-17 18:17:48 +08:00
ancv
988231026a update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99
2024-06-16 02:25:47 +07:00
hiyouga
c0c6b8075a tiny fix
Former-commit-id: 38b6b0f52e
2024-06-16 01:06:41 +08:00
ancv
9d9f8c6531 remove some unused params
Former-commit-id: 04315c3d92
2024-06-15 23:00:55 +07:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa6
2024-06-15 17:54:33 +08:00
hiyouga
833aa324c2 clean code
Former-commit-id: 2ed8270112
2024-06-13 01:58:16 +08:00
ancv
045eb155a2 implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61
2024-06-12 11:56:01 +07:00
hiyouga
8c574eb3cb fix deepspeed version
Former-commit-id: cca6f35108
2024-06-11 16:52:36 +08:00
hiyouga
e3baa5aa08 tiny fix
Former-commit-id: 3f24337a8a
2024-06-11 01:04:16 +08:00
hiyouga
2f164c2c41 fix #4160
The split heads should be concatenated in dim=2


Former-commit-id: a793e8456b
2024-06-11 00:37:17 +08:00
hiyouga
8da149ba40 rename files
Former-commit-id: 74f96efef9
2024-06-07 00:09:06 +08:00