Commit Graph

26 Commits

Author SHA1 Message Date
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
38b6b0f52e tiny fix 2024-06-16 01:06:41 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
b27269bd2b add test cases 2024-06-15 04:05:54 +08:00
hiyouga
2ed8270112 clean code 2024-06-13 01:58:16 +08:00
hiyouga
cca6f35108 fix deepspeed version 2024-06-11 16:52:36 +08:00
hiyouga
89f2bd8c8c fix #4198 2024-06-11 15:38:38 +08:00
hiyouga
3f24337a8a tiny fix 2024-06-11 01:04:16 +08:00
hiyouga
a793e8456b fix #4160
The split heads should be concatenated in dim=2
2024-06-11 00:37:17 +08:00
hiyouga
c907d81667 fix #2666 2024-06-10 21:24:15 +08:00
hiyouga
54cd743ebf reorganize adapter code 2024-06-08 00:47:23 +08:00
hoshi-hiyouga
cfd62283a9 fix #4139 2024-06-08 00:45:02 +08:00
hiyouga
06e5d136a4 add resume args in webui 2024-06-08 00:22:16 +08:00
hiyouga
74f96efef9 rename files 2024-06-07 00:09:06 +08:00
hiyouga
451b6693c0 fix torch gc 2024-06-06 20:30:25 +08:00
hiyouga
a12a506c3d support train from scratch #4033 #4075 2024-06-06 02:43:19 +08:00
hiyouga
08564838bd fix full/freeze tuning for mllm 2024-05-27 20:37:57 +08:00
BUAADreamer
57eb13b75d add regex of only tune lm and mm_proj 2024-05-27 18:59:00 +08:00
BUAADreamer
60170a1da4 Merge branch 'hiyouga:main' into main 2024-05-25 14:18:49 +08:00
hiyouga
063f91cc80 fix #3853 2024-05-24 23:29:45 +08:00
BUAADreamer
29a6d5bdb8 support pretraining of llava 2024-05-21 08:57:14 +08:00
hiyouga
31a0564d4f fix zero2 high ram usage 2024-05-19 21:53:54 +08:00
hiyouga
1ebc890a5f fix #3807 2024-05-19 17:07:57 +08:00
hiyouga
d43822fcc2 fix jetmoe z3 block 2024-05-18 22:28:45 +08:00
hiyouga
d9f190ff1e better dtype handle in loading 2024-05-17 02:14:56 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00