hiyouga
|
ca7b65439d
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 1771251ce3
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
b0acd27114
|
increase pissa_iter for stability
Former-commit-id: 64f4337dac
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a294ef2fae
|
fix #4549
Former-commit-id: 8ed6b367e2
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
7c488cea57
|
tiny fix
Former-commit-id: e44a4f07f0
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
f3f25ae3b7
|
lint
Former-commit-id: 555ca8d780
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
fe6ef6400c
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5b
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
d519c2fde5
|
tiny fix
Former-commit-id: 41086059b1
|
2024-06-25 01:15:19 +08:00 |
|
hoshi-hiyouga
|
709bbc1d92
|
Merge pull request #4417 from mMrBun/main
Add tool_format parameter to rewrite templates for different function call formats.
Former-commit-id: def6d280db
|
2024-06-24 23:17:55 +08:00 |
|
hiyouga
|
47651a94a3
|
fix #4410
Former-commit-id: fca893d73c
|
2024-06-24 22:34:31 +08:00 |
|
hoshi-hiyouga
|
e74fcdf7b1
|
Update parser.py
Former-commit-id: e90c424f55
|
2024-06-24 21:37:42 +08:00 |
|
stceum
|
9aa640f27b
|
Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281
|
2024-06-24 20:39:31 +08:00 |
|
mMrBun
|
c0e005e2ea
|
Add tool_format to overwrite tool formatter template
Former-commit-id: 20e2e6fdcb
|
2024-06-22 02:13:23 +08:00 |
|
hiyouga
|
0844750bb9
|
tiny fix
Former-commit-id: 8d4f5093cf
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
5156114981
|
fix #4357
Former-commit-id: 4bd77d8563
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
c2734108e7
|
fix typo
Former-commit-id: 8f7c78b641
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9
|
2024-06-18 12:27:47 +08:00 |
|
hiyouga
|
19bf21efba
|
lint
Former-commit-id: 24c160df3d
|
2024-06-17 22:35:56 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e0
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b4372778
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
06bbc29614
|
Update parser.py
Former-commit-id: 29c1f31baa
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78a
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52e
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
96b82ccd4d
|
use fixture
Former-commit-id: 80a9e6bf94
|
2024-06-15 20:06:17 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa6
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
fcbfa70c19
|
disable DP
Former-commit-id: d519b4d76d
|
2024-06-15 04:57:19 +08:00 |
|
hiyouga
|
a3f4925c2c
|
add test cases
Former-commit-id: b27269bd2b
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
99ce085415
|
fix lint
Former-commit-id: 713fde4259
|
2024-06-13 00:48:44 +08:00 |
|
hiyouga
|
5834651c4a
|
fix #4198
Former-commit-id: 89f2bd8c8c
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
53de7f7cc3
|
tiny fix
Former-commit-id: 90e14a960d
|
2024-06-11 12:48:53 +08:00 |
|
hiyouga
|
4f0ce9be4e
|
reorganize adapter code
Former-commit-id: 54cd743ebf
|
2024-06-08 00:47:23 +08:00 |
|
hiyouga
|
cceff9f520
|
lora modules: all by default
Former-commit-id: cae4737907
|
2024-06-06 03:53:28 +08:00 |
|
hoshi-hiyouga
|
d31c9c73c7
|
Merge pull request #4080 from MengqingCao/npu
Add npu option for model exporting
Former-commit-id: ca459f67eb
|
2024-06-06 03:15:44 +08:00 |
|
hoshi-hiyouga
|
d9a372658a
|
Update model_args.py
Former-commit-id: af2c3cbee4
|
2024-06-06 03:14:23 +08:00 |
|
hiyouga
|
c439c959f7
|
add vllm_dtype arg #3387 #3717
Former-commit-id: 8fcc79e1e6
|
2024-06-06 02:53:27 +08:00 |
|
hiyouga
|
3fcb678d00
|
support train from scratch #4033 #4075
Former-commit-id: a12a506c3d
|
2024-06-06 02:43:19 +08:00 |
|
MengqingCao
|
15f6ab73a5
|
add npu for model export
Former-commit-id: 07045c876a
|
2024-06-05 07:06:40 +00:00 |
|
hiyouga
|
e4ce59243b
|
fix #4005 #4013
Former-commit-id: eed33862bc
|
2024-06-03 19:12:29 +08:00 |
|
hoshi-hiyouga
|
eaab09fccb
|
Merge pull request #4007 from xu-song/patch-3
Update model_args.py
Former-commit-id: 1539c72b94
|
2024-06-03 18:54:37 +08:00 |
|
hiyouga
|
d0ceb1b091
|
fix #4022
Former-commit-id: 24e1c0e2ee
|
2024-06-03 18:38:36 +08:00 |
|
Xu Song
|
abe33220bf
|
Update model_args.py
Former-commit-id: dade2f083d
|
2024-05-31 14:35:48 +08:00 |
|
hiyouga
|
820404946e
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 8070871732
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
87e71df597
|
bump vllm version to 0.4.1
Former-commit-id: 1e80a3a638
|
2024-05-28 21:27:27 +08:00 |
|
hiyouga
|
3ea8f5e6b9
|
support DDP in webui
Former-commit-id: 7c016b22aa
|
2024-05-28 19:24:22 +08:00 |
|
hiyouga
|
b88ecd71fd
|
fix full/freeze tuning for mllm
Former-commit-id: 08564838bd
|
2024-05-27 20:37:57 +08:00 |
|
BUAADreamer
|
a6c2a2071d
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 4bc7c10c00
|
2024-05-27 11:54:01 +08:00 |
|
hiyouga
|
4807c11db8
|
support SimPO #3900
Former-commit-id: cb63b32986
|
2024-05-26 23:46:33 +08:00 |
|