zhuHQ
|
d9189f9f0b
|
[optim] add support to APOLLO (#6617)
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
1c7663d304
|
pin vllm version to 0.6.5 (#6629)
|
2025-01-14 02:44:02 +08:00 |
|
hoshi-hiyouga
|
6b34b69fa6
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
9c4c84828b
|
fix get ray args when args not a dict
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
d8cac6f546
|
refactor ray integration, support save ckpt
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
1e8e7be0a5
|
run style check
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
163ddb680b
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
58ab4579dc
|
add vllm config
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
3af57795dd
|
tiny fix
|
2024-10-11 23:51:54 +08:00 |
|
Johnny
|
e5849cdcce
|
Update parser.py
|
2024-10-11 12:29:33 +02:00 |
|
hoshi-hiyouga
|
1ce0b42b1e
|
Update parser.py
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
4e638777eb
|
Update parser.py
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
6c1aef5560
|
Update parser.py
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
a45f3f5461
|
fix #5611
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
b6681d7198
|
support vllm 0.6.0
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
a7dd7d325e
|
update liger kernel
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
aa1afdc756
|
fix #5292
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
72bc8f0111
|
support liger kernel
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
hiyouga
|
8f6995081c
|
update parser
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
779aae83d2
|
follow #4878 fix #4684
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
1e7b396ff2
|
仅仅训练最后一轮对话
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
99ab7a8c1c
|
allow computing rouge in training
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
88a20ba797
|
fix #4699
slow tokenizer for yi models
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
575a02a23d
|
update hparams
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
e8e13b0942
|
move efficient_packing from data_args to model_args
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
8ed6b367e2
|
fix #4549
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
e44a4f07f0
|
tiny fix
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
555ca8d780
|
lint
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
d0f953bf5b
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
8d4f5093cf
|
tiny fix
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
4bd77d8563
|
fix #4357
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
8f7c78b641
|
fix typo
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |
|