hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
bba52e258e
|
run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
1217240918
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
813f5919a3
|
fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
235cdcacee
|
support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
0d18cca0db
|
add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
e83cb17f97
|
support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
0d8aa6e6ef
|
use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
e90a1199da
|
tiny fix
Former-commit-id: 3af57795dda5d236200bad4aa3f2e29ae8930fe2
|
2024-10-11 23:51:54 +08:00 |
|
Johnny
|
27be1e2122
|
Update parser.py
Former-commit-id: e5849cdcce109e15547edcf9a692e7c13d625e5a
|
2024-10-11 12:29:33 +02:00 |
|
hoshi-hiyouga
|
b855d3421e
|
Update parser.py
Former-commit-id: 1ce0b42b1e30cb5419c91702a499f23d52db43ee
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
059c2ffbea
|
Update parser.py
Former-commit-id: 4e638777ebcbf7dea22011361fb341bafe6ba9d9
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
9a6045eee6
|
Update parser.py
Former-commit-id: 6c1aef55604649a956fe928d89280626923815b8
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
56132983cf
|
fix #5611
Former-commit-id: a45f3f5461e2936b9e119eda2ef4d8c7a4131740
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
78cf256067
|
support vllm 0.6.0
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
f31e7e0dfc
|
remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
0e4ee9d9a3
|
update liger kernel
Former-commit-id: a7dd7d325e68c92c7470c1e9ef83a7c8abcbc616
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
f153ee13be
|
fix #5292
Former-commit-id: aa1afdc75614868172bd2f9c052647b8f226d3f2
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
c765292093
|
support liger kernel
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
5eacd17090
|
add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
b5146facff
|
follow #5115
Former-commit-id: c87023d539875cd8e622d40212a5627c9c182fb8
|
2024-08-09 18:03:00 +08:00 |
|
hiyouga
|
542658c986
|
update parser
Former-commit-id: 8f6995081cbdbb2424da586a443e5220a8990faa
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
34f16cc635
|
follow #4878 fix #4684
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
249adacc4d
|
仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
22859b8734
|
allow computing rouge in training
Former-commit-id: 99ab7a8c1c966232faa11b6a42b9740d9a20ace3
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
dfd2d912cd
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 88a20ba7972c533d650967a118d612471fe2b2e8
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
ff6fc666c1
|
update hparams
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
7f42932957
|
move efficient_packing from data_args to model_args
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a294ef2fae
|
fix #4549
Former-commit-id: 8ed6b367e26490acab5d2d7b32f0d5dad449d26a
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
7c488cea57
|
tiny fix
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
f3f25ae3b7
|
lint
Former-commit-id: 555ca8d780a1fbaf42e73450f5eb33048329d921
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
fe6ef6400c
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: d0f953bf5bdbfd49acc82ff055bd54889241761a
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
0844750bb9
|
tiny fix
Former-commit-id: 8d4f5093cfcccfe9df173b4c4f7ec0125aecf198
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749a265dd3c979189ec491d8ac911a6f6
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
5156114981
|
fix #4357
Former-commit-id: 4bd77d8563aa85230af65caf901214247e214bed
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
c2734108e7
|
fix typo
Former-commit-id: 8f7c78b64138602406af748b0e15948ebbd2dcb5
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9227e33ad20d2b1641b1c9faae16a5f
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e030504e07053484f50f4cbdb37808bc
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b437277846d4f0b64c13a0bc892ef4f345a21e
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
06bbc29614
|
Update parser.py
Former-commit-id: 29c1f31baa442e35714b18b7e51896274a828cae
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
fcbfa70c19
|
disable DP
Former-commit-id: d519b4d76d39b21a21b1d2f6f7ce6b3af9525d03
|
2024-06-15 04:57:19 +08:00 |
|