hiyouga
|
7ef169ed39
|
fix #4326
Former-commit-id: e2665e71c7
|
2024-06-17 18:17:48 +08:00 |
|
ancv
|
988231026a
|
update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52e
|
2024-06-16 01:06:41 +08:00 |
|
ancv
|
9d9f8c6531
|
remove some unused params
Former-commit-id: 04315c3d92
|
2024-06-15 23:00:55 +07:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa6
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
833aa324c2
|
clean code
Former-commit-id: 2ed8270112
|
2024-06-13 01:58:16 +08:00 |
|
ancv
|
045eb155a2
|
implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
8c574eb3cb
|
fix deepspeed version
Former-commit-id: cca6f35108
|
2024-06-11 16:52:36 +08:00 |
|
hiyouga
|
e3baa5aa08
|
tiny fix
Former-commit-id: 3f24337a8a
|
2024-06-11 01:04:16 +08:00 |
|
hiyouga
|
2f164c2c41
|
fix #4160
The split heads should be concatenated in dim=2
Former-commit-id: a793e8456b
|
2024-06-11 00:37:17 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9
|
2024-06-07 00:09:06 +08:00 |
|