ancv
|
9d9f8c6531
|
remove some unused params
Former-commit-id: 04315c3d92ecc25537e45d5807cb38bc290dcb16
|
2024-06-15 23:00:55 +07:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
a3f4925c2c
|
add test cases
Former-commit-id: b27269bd2b52fb9d43cde8a8b7f293099b0127a2
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
833aa324c2
|
clean code
Former-commit-id: 2ed8270112755971e3f2dfd2f29c5939b077330a
|
2024-06-13 01:58:16 +08:00 |
|
ancv
|
045eb155a2
|
implement efficient packing without cross-contamination attention
Former-commit-id: b2c367bc61c2778dc359613dca496d9e134c2743
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
8c574eb3cb
|
fix deepspeed version
Former-commit-id: cca6f351081903ca3b5f79f10accc1bbbae0ee61
|
2024-06-11 16:52:36 +08:00 |
|
hiyouga
|
5834651c4a
|
fix #4198
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
e3baa5aa08
|
tiny fix
Former-commit-id: 3f24337a8a995b145b1e8075bc23878eaa363844
|
2024-06-11 01:04:16 +08:00 |
|
hiyouga
|
2f164c2c41
|
fix #4160
The split heads should be concatenated in dim=2
Former-commit-id: a793e8456b664ea0b48f0ba162999f18d06b4c2f
|
2024-06-11 00:37:17 +08:00 |
|
hiyouga
|
3b244a69dc
|
fix #2666
Former-commit-id: c907d816670975daa900898660d3503708b7fc37
|
2024-06-10 21:24:15 +08:00 |
|
hiyouga
|
4f0ce9be4e
|
reorganize adapter code
Former-commit-id: 54cd743ebfbd296ae9eaf10c33f59e127f451785
|
2024-06-08 00:47:23 +08:00 |
|
hoshi-hiyouga
|
bad35d1730
|
fix #4139
Former-commit-id: cfd62283a9772fc854b852d2a1b71699f79a0048
|
2024-06-08 00:45:02 +08:00 |
|
hiyouga
|
a8318723a4
|
add resume args in webui
Former-commit-id: 06e5d136a4916413d1c116e341ba7d5136d7748a
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9bcd63f65d0190c901ff9be54ccd350
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
6cbc66a602
|
fix torch gc
Former-commit-id: 451b6693c0cb86cc9ac03d1a9389cf1fd2b918ec
|
2024-06-06 20:30:25 +08:00 |
|
hiyouga
|
3fcb678d00
|
support train from scratch #4033 #4075
Former-commit-id: a12a506c3d2ba85975a5990c46d2e055cdfe0f2e
|
2024-06-06 02:43:19 +08:00 |
|
hiyouga
|
b88ecd71fd
|
fix full/freeze tuning for mllm
Former-commit-id: 08564838bd02651668845ed74e2e60561e5b6d8c
|
2024-05-27 20:37:57 +08:00 |
|
BUAADreamer
|
606240aec0
|
add regex of only tune lm and mm_proj
Former-commit-id: 57eb13b75d8597d748e84d3549a0b08876b669db
|
2024-05-27 18:59:00 +08:00 |
|
BUAADreamer
|
3eaf371a22
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 60170a1da42a395cf440bbd3825c4e295c31ac38
|
2024-05-25 14:18:49 +08:00 |
|
hiyouga
|
e5d2ef4434
|
fix #3853
Former-commit-id: 063f91cc80193853d17c55fe092fb33683f5d39c
|
2024-05-24 23:29:45 +08:00 |
|
BUAADreamer
|
071d674065
|
support pretraining of llava
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
|
2024-05-21 08:57:14 +08:00 |
|
hiyouga
|
8d4a5ebf6e
|
fix zero2 high ram usage
Former-commit-id: 31a0564d4f4886db03250f2c6daee6e042dc3eb4
|
2024-05-19 21:53:54 +08:00 |
|
hiyouga
|
df4aec7e72
|
fix #3807
Former-commit-id: 1ebc890a5ff7b034c112bc9cf5cd8a6936613572
|
2024-05-19 17:07:57 +08:00 |
|
hiyouga
|
7130efff54
|
fix jetmoe z3 block
Former-commit-id: d43822fcc220806b9eb7cbf9336ef42a0e6b2a51
|
2024-05-18 22:28:45 +08:00 |
|
hiyouga
|
780a1f5a4e
|
better dtype handle in loading
Former-commit-id: d9f190ff1ea1cc4dd061e8b03d429caea037bca4
|
2024-05-17 02:14:56 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
|
2024-05-16 18:39:08 +08:00 |
|