Commit Graph

110 Commits

Author SHA1 Message Date
hiyouga
8ed6b367e2 fix #4549 2024-06-28 00:41:58 +08:00
hiyouga
e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hiyouga
ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hiyouga
555ca8d780 lint 2024-06-25 02:55:50 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
hoshi-hiyouga
d0f953bf5b Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
2024-06-25 01:49:13 +08:00
hiyouga
41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga
def6d280db Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.
2024-06-24 23:17:55 +08:00
hiyouga
fca893d73c fix #4410 2024-06-24 22:34:31 +08:00
hoshi-hiyouga
e90c424f55 Update parser.py 2024-06-24 21:37:42 +08:00
stceum
3ed063f281 Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this. 2024-06-24 20:39:31 +08:00
mMrBun
20e2e6fdcb Add tool_format to overwrite tool formatter template 2024-06-22 02:13:23 +08:00
ancv
770f75dc83 move configure_packing to llamafactory.model.patcher and fix constants 2024-06-21 00:45:06 +07:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga
4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
Jonery
8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
hiyouga
24c160df3d lint 2024-06-17 22:35:56 +08:00
Jonery
ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery
33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00
hoshi-hiyouga
29c1f31baa Update parser.py 2024-06-16 02:57:00 +08:00
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
38b6b0f52e tiny fix 2024-06-16 01:06:41 +08:00
hiyouga
80a9e6bf94 use fixture 2024-06-15 20:06:17 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
d519b4d76d disable DP 2024-06-15 04:57:19 +08:00
hiyouga
b27269bd2b add test cases 2024-06-15 04:05:54 +08:00
hiyouga
713fde4259 fix lint 2024-06-13 00:48:44 +08:00
ancv
b2c367bc61 implement efficient packing without cross-contamination attention 2024-06-12 11:56:01 +07:00
hiyouga
89f2bd8c8c fix #4198 2024-06-11 15:38:38 +08:00
hiyouga
90e14a960d tiny fix 2024-06-11 12:48:53 +08:00
hiyouga
54cd743ebf reorganize adapter code 2024-06-08 00:47:23 +08:00
hiyouga
cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hoshi-hiyouga
ca459f67eb Merge pull request #4080 from MengqingCao/npu
Add npu option for model exporting
2024-06-06 03:15:44 +08:00
hoshi-hiyouga
af2c3cbee4 Update model_args.py 2024-06-06 03:14:23 +08:00
hiyouga
8fcc79e1e6 add vllm_dtype arg #3387 #3717 2024-06-06 02:53:27 +08:00
hiyouga
a12a506c3d support train from scratch #4033 #4075 2024-06-06 02:43:19 +08:00
MengqingCao
07045c876a add npu for model export 2024-06-05 07:06:40 +00:00
hiyouga
eed33862bc fix #4005 #4013 2024-06-03 19:12:29 +08:00
hoshi-hiyouga
1539c72b94 Merge pull request #4007 from xu-song/patch-3
Update model_args.py
2024-06-03 18:54:37 +08:00
hiyouga
24e1c0e2ee fix #4022 2024-06-03 18:38:36 +08:00
Xu Song
dade2f083d Update model_args.py 2024-05-31 14:35:48 +08:00
hiyouga
8070871732 better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
2024-05-29 23:55:38 +08:00
hiyouga
1e80a3a638 bump vllm version to 0.4.1 2024-05-28 21:27:27 +08:00
hiyouga
7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga
08564838bd fix full/freeze tuning for mllm 2024-05-27 20:37:57 +08:00
BUAADreamer
4bc7c10c00 Merge branch 'hiyouga:main' into main 2024-05-27 11:54:01 +08:00
hiyouga
cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00
BUAADreamer
047a06a1e5 Merge branch 'hiyouga:main' into main 2024-05-24 09:50:00 +08:00