Commit Graph

61 Commits

Author SHA1 Message Date
hiyouga
ca7b65439d fix #4402 #4617
Deprecate reserved_label_len arg


Former-commit-id: 1771251ce3
2024-07-01 01:19:27 +08:00
hiyouga
b0acd27114 increase pissa_iter for stability
Former-commit-id: 64f4337dac
2024-06-28 03:18:54 +08:00
hiyouga
835f0578c2 refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0
2024-06-28 01:04:24 +08:00
hiyouga
a294ef2fae fix #4549
Former-commit-id: 8ed6b367e2
2024-06-28 00:41:58 +08:00
hiyouga
7c488cea57 tiny fix
Former-commit-id: e44a4f07f0
2024-06-27 20:14:48 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265
2024-06-27 00:29:42 +08:00
hiyouga
f3f25ae3b7 lint
Former-commit-id: 555ca8d780
2024-06-25 02:55:50 +08:00
hiyouga
a225b5a70c tiny fix about badam
Former-commit-id: 095fab58d3
2024-06-25 01:54:53 +08:00
hoshi-hiyouga
fe6ef6400c Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam

Former-commit-id: d0f953bf5b
2024-06-25 01:49:13 +08:00
hiyouga
d519c2fde5 tiny fix
Former-commit-id: 41086059b1
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
709bbc1d92 Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.

Former-commit-id: def6d280db
2024-06-24 23:17:55 +08:00
hiyouga
47651a94a3 fix #4410
Former-commit-id: fca893d73c
2024-06-24 22:34:31 +08:00
hoshi-hiyouga
e74fcdf7b1 Update parser.py
Former-commit-id: e90c424f55
2024-06-24 21:37:42 +08:00
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281
2024-06-24 20:39:31 +08:00
mMrBun
c0e005e2ea Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb
2024-06-22 02:13:23 +08:00
hiyouga
0844750bb9 tiny fix
Former-commit-id: 8d4f5093cf
2024-06-20 22:56:05 +08:00
Jonery
c779899f7b Cleaner integration.
Former-commit-id: 5c2ff1b749
2024-06-19 12:29:40 +08:00
hiyouga
5156114981 fix #4357
Former-commit-id: 4bd77d8563
2024-06-18 22:42:45 +08:00
Jonery
c2734108e7 fix typo
Former-commit-id: 8f7c78b641
2024-06-18 12:39:26 +08:00
Jonery
3a5eacb4cf Support distributed BAdam.
Former-commit-id: 0f72aac8c9
2024-06-18 12:27:47 +08:00
hiyouga
19bf21efba lint
Former-commit-id: 24c160df3d
2024-06-17 22:35:56 +08:00
Jonery
5d59f6562a Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e0
2024-06-17 18:44:51 +08:00
Jonery
756566342d adapt for badam with ds zero3
Former-commit-id: 33b4372778
2024-06-17 18:18:10 +08:00
hoshi-hiyouga
06bbc29614 Update parser.py
Former-commit-id: 29c1f31baa
2024-06-16 02:57:00 +08:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78a
2024-06-16 01:08:12 +08:00
hiyouga
c0c6b8075a tiny fix
Former-commit-id: 38b6b0f52e
2024-06-16 01:06:41 +08:00
hiyouga
96b82ccd4d use fixture
Former-commit-id: 80a9e6bf94
2024-06-15 20:06:17 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa6
2024-06-15 17:54:33 +08:00
hiyouga
fcbfa70c19 disable DP
Former-commit-id: d519b4d76d
2024-06-15 04:57:19 +08:00
hiyouga
a3f4925c2c add test cases
Former-commit-id: b27269bd2b
2024-06-15 04:05:54 +08:00
hiyouga
99ce085415 fix lint
Former-commit-id: 713fde4259
2024-06-13 00:48:44 +08:00
hiyouga
5834651c4a fix #4198
Former-commit-id: 89f2bd8c8c
2024-06-11 15:38:38 +08:00
hiyouga
53de7f7cc3 tiny fix
Former-commit-id: 90e14a960d
2024-06-11 12:48:53 +08:00
hiyouga
4f0ce9be4e reorganize adapter code
Former-commit-id: 54cd743ebf
2024-06-08 00:47:23 +08:00
hiyouga
cceff9f520 lora modules: all by default
Former-commit-id: cae4737907
2024-06-06 03:53:28 +08:00
hoshi-hiyouga
d31c9c73c7 Merge pull request #4080 from MengqingCao/npu
Add npu option for model exporting

Former-commit-id: ca459f67eb
2024-06-06 03:15:44 +08:00
hoshi-hiyouga
d9a372658a Update model_args.py
Former-commit-id: af2c3cbee4
2024-06-06 03:14:23 +08:00
hiyouga
c439c959f7 add vllm_dtype arg #3387 #3717
Former-commit-id: 8fcc79e1e6
2024-06-06 02:53:27 +08:00
hiyouga
3fcb678d00 support train from scratch #4033 #4075
Former-commit-id: a12a506c3d
2024-06-06 02:43:19 +08:00
MengqingCao
15f6ab73a5 add npu for model export
Former-commit-id: 07045c876a
2024-06-05 07:06:40 +00:00
hiyouga
e4ce59243b fix #4005 #4013
Former-commit-id: eed33862bc
2024-06-03 19:12:29 +08:00
hoshi-hiyouga
eaab09fccb Merge pull request #4007 from xu-song/patch-3
Update model_args.py

Former-commit-id: 1539c72b94
2024-06-03 18:54:37 +08:00
hiyouga
d0ceb1b091 fix #4022
Former-commit-id: 24e1c0e2ee
2024-06-03 18:38:36 +08:00
Xu Song
abe33220bf Update model_args.py
Former-commit-id: dade2f083d
2024-05-31 14:35:48 +08:00
hiyouga
820404946e better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui


Former-commit-id: 8070871732
2024-05-29 23:55:38 +08:00
hiyouga
87e71df597 bump vllm version to 0.4.1
Former-commit-id: 1e80a3a638
2024-05-28 21:27:27 +08:00
hiyouga
3ea8f5e6b9 support DDP in webui
Former-commit-id: 7c016b22aa
2024-05-28 19:24:22 +08:00
hiyouga
b88ecd71fd fix full/freeze tuning for mllm
Former-commit-id: 08564838bd
2024-05-27 20:37:57 +08:00
BUAADreamer
a6c2a2071d Merge branch 'hiyouga:main' into main
Former-commit-id: 4bc7c10c00
2024-05-27 11:54:01 +08:00
hiyouga
4807c11db8 support SimPO #3900
Former-commit-id: cb63b32986
2024-05-26 23:46:33 +08:00