65 Commits

Author SHA1 Message Date
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
2024-06-24 20:39:31 +08:00
ancv
5319447aa5 move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
2024-06-21 00:45:06 +07:00
hiyouga
030b4811c7 update patcher
Former-commit-id: 3b040e8e0f78dbb6bc1409a1b2b788e1affc7458
2024-06-19 21:27:00 +08:00
hiyouga
5156114981 fix #4357
Former-commit-id: 4bd77d8563aa85230af65caf901214247e214bed
2024-06-18 22:42:45 +08:00
hiyouga
7ef169ed39 fix #4326
Former-commit-id: e2665e71c7428014d46d91542b01a58c1064d05a
2024-06-17 18:17:48 +08:00
ancv
988231026a update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99809c6ae2571b59bdce8d8ea3c700b9
2024-06-16 02:25:47 +07:00
hiyouga
c0c6b8075a tiny fix
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
2024-06-16 01:06:41 +08:00
ancv
9d9f8c6531 remove some unused params
Former-commit-id: 04315c3d92ecc25537e45d5807cb38bc290dcb16
2024-06-15 23:00:55 +07:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
833aa324c2 clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
2024-06-13 01:58:16 +08:00
ancv
045eb155a2 implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61c2778dc359613dca496d9e134c2743
2024-06-12 11:56:01 +07:00
hiyouga
8c574eb3cb fix deepspeed version
Former-commit-id: cca6f351081903ca3b5f79f10accc1bbbae0ee61
2024-06-11 16:52:36 +08:00
hiyouga
e3baa5aa08 tiny fix
Former-commit-id: 3f24337a8a995b145b1e8075bc23878eaa363844
2024-06-11 01:04:16 +08:00
hiyouga
2f164c2c41 fix #4160
The split heads should be concatenated in dim=2


Former-commit-id: a793e8456b664ea0b48f0ba162999f18d06b4c2f
2024-06-11 00:37:17 +08:00
hiyouga
8da149ba40 rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
2024-06-07 00:09:06 +08:00