Commit Graph

24 Commits

Author SHA1 Message Date
hiyouga
eb2db2af8e fix #5611 2024-10-06 10:33:11 +08:00
hiyouga
fe7ffccdb9 fix #5542 2024-09-30 23:28:55 +08:00
hiyouga
45841bb646 add patch processor func 2024-09-30 17:07:43 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga
5665062ca0 tiny fix 2024-07-22 21:10:15 +08:00
hoshi-hiyouga
26082fc6c9 fix #4917 2024-07-22 11:28:31 +08:00
hiyouga
2f6af73da2 fix gemma2 attention 2024-07-13 23:33:45 +08:00
hiyouga
8a6a7b9c8a update arg name 2024-07-03 23:23:24 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hoshi-hiyouga
e8e6af2651 Merge branch 'main' into main 2024-07-01 21:01:09 +08:00
hiyouga
4d35e218b1 bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
2024-06-28 06:00:26 +08:00
hiyouga
fca893d73c fix #4410 2024-06-24 22:34:31 +08:00
ancv
770f75dc83 move configure_packing to llamafactory.model.patcher and fix constants 2024-06-21 00:45:06 +07:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
hiyouga
3b040e8e0f update patcher 2024-06-19 21:27:00 +08:00
hiyouga
e2665e71c7 fix #4326 2024-06-17 18:17:48 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
b27269bd2b add test cases 2024-06-15 04:05:54 +08:00
hiyouga
89f2bd8c8c fix #4198 2024-06-11 15:38:38 +08:00
hiyouga
74f96efef9 rename files 2024-06-07 00:09:06 +08:00
hiyouga
31a0564d4f fix zero2 high ram usage 2024-05-19 21:53:54 +08:00
hiyouga
d9f190ff1e better dtype handle in loading 2024-05-17 02:14:56 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00