stceum
|
3ed063f281
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
|
2024-06-24 20:39:31 +08:00 |
|
ancv
|
770f75dc83
|
move configure_packing to llamafactory.model.patcher and fix constants
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
3b040e8e0f
|
update patcher
|
2024-06-19 21:27:00 +08:00 |
|
hiyouga
|
4bd77d8563
|
fix #4357
|
2024-06-18 22:42:45 +08:00 |
|
hiyouga
|
e2665e71c7
|
fix #4326
|
2024-06-17 18:17:48 +08:00 |
|
ancv
|
238f5c3d99
|
update packing with sdpa and eager attention mode
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
38b6b0f52e
|
tiny fix
|
2024-06-16 01:06:41 +08:00 |
|
ancv
|
04315c3d92
|
remove some unused params
|
2024-06-15 23:00:55 +07:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
2ed8270112
|
clean code
|
2024-06-13 01:58:16 +08:00 |
|
ancv
|
b2c367bc61
|
implement efficient packing without cross-contamination attention
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
cca6f35108
|
fix deepspeed version
|
2024-06-11 16:52:36 +08:00 |
|
hiyouga
|
3f24337a8a
|
tiny fix
|
2024-06-11 01:04:16 +08:00 |
|
hiyouga
|
a793e8456b
|
fix #4160
The split heads should be concatenated in dim=2
|
2024-06-11 00:37:17 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|