176 Commits

Author SHA1 Message Date
ancv
9d9f8c6531 remove some unused params
Former-commit-id: 04315c3d92ecc25537e45d5807cb38bc290dcb16
2024-06-15 23:00:55 +07:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
a3f4925c2c add test cases
Former-commit-id: b27269bd2b52fb9d43cde8a8b7f293099b0127a2
2024-06-15 04:05:54 +08:00
hiyouga
833aa324c2 clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
2024-06-13 01:58:16 +08:00
ancv
045eb155a2 implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61c2778dc359613dca496d9e134c2743
2024-06-12 11:56:01 +07:00
hiyouga
8c574eb3cb fix deepspeed version
Former-commit-id: cca6f351081903ca3b5f79f10accc1bbbae0ee61
2024-06-11 16:52:36 +08:00
hiyouga
5834651c4a fix #4198
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
2024-06-11 15:38:38 +08:00
hiyouga
e3baa5aa08 tiny fix
Former-commit-id: 3f24337a8a995b145b1e8075bc23878eaa363844
2024-06-11 01:04:16 +08:00
hiyouga
2f164c2c41 fix #4160
The split heads should be concatenated in dim=2


Former-commit-id: a793e8456b664ea0b48f0ba162999f18d06b4c2f
2024-06-11 00:37:17 +08:00
hiyouga
3b244a69dc fix #2666
Former-commit-id: c907d816670975daa900898660d3503708b7fc37
2024-06-10 21:24:15 +08:00
hiyouga
4f0ce9be4e reorganize adapter code
Former-commit-id: 54cd743ebfbd296ae9eaf10c33f59e127f451785
2024-06-08 00:47:23 +08:00
hoshi-hiyouga
bad35d1730 fix #4139
Former-commit-id: cfd62283a9772fc854b852d2a1b71699f79a0048
2024-06-08 00:45:02 +08:00
hiyouga
a8318723a4 add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
2024-06-08 00:22:16 +08:00
hiyouga
8da149ba40 rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
2024-06-07 00:09:06 +08:00
hiyouga
6cbc66a602 fix torch gc
Former-commit-id: 451b6693c0cb86cc9ac03d1a9389cf1fd2b918ec
2024-06-06 20:30:25 +08:00
hiyouga
3fcb678d00 support train from scratch #4033 #4075
Former-commit-id: a12a506c3d2ba85975a5990c46d2e055cdfe0f2e
2024-06-06 02:43:19 +08:00
hiyouga
b88ecd71fd fix full/freeze tuning for mllm
Former-commit-id: 08564838bd02651668845ed74e2e60561e5b6d8c
2024-05-27 20:37:57 +08:00
BUAADreamer
606240aec0 add regex of only tune lm and mm_proj
Former-commit-id: 57eb13b75d8597d748e84d3549a0b08876b669db
2024-05-27 18:59:00 +08:00
BUAADreamer
3eaf371a22 Merge branch 'hiyouga:main' into main
Former-commit-id: 60170a1da42a395cf440bbd3825c4e295c31ac38
2024-05-25 14:18:49 +08:00
hiyouga
e5d2ef4434 fix #3853
Former-commit-id: 063f91cc80193853d17c55fe092fb33683f5d39c
2024-05-24 23:29:45 +08:00
BUAADreamer
071d674065 support pretraining of llava
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hiyouga
8d4a5ebf6e fix zero2 high ram usage
Former-commit-id: 31a0564d4f4886db03250f2c6daee6e042dc3eb4
2024-05-19 21:53:54 +08:00
hiyouga
df4aec7e72 fix #3807
Former-commit-id: 1ebc890a5ff7b034c112bc9cf5cd8a6936613572
2024-05-19 17:07:57 +08:00
hiyouga
7130efff54 fix jetmoe z3 block
Former-commit-id: d43822fcc220806b9eb7cbf9336ef42a0e6b2a51
2024-05-18 22:28:45 +08:00
hiyouga
780a1f5a4e better dtype handle in loading
Former-commit-id: d9f190ff1ea1cc4dd061e8b03d429caea037bca4
2024-05-17 02:14:56 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00