Commit Graph

61 Commits

Author SHA1 Message Date
zhuHQ
d9189f9f0b [optim] add support to APOLLO (#6617) 2025-01-15 00:24:56 +08:00
hoshi-hiyouga
1c7663d304 pin vllm version to 0.6.5 (#6629) 2025-01-14 02:44:02 +08:00
hoshi-hiyouga
6b34b69fa6 Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
2025-01-08 18:14:18 +08:00
zhubin
9c4c84828b fix –get ray args when args not a dict 2025-01-08 10:06:02 +00:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
hiyouga
d8cac6f546 refactor ray integration, support save ckpt 2025-01-07 09:39:10 +00:00
Eric Tang
1e8e7be0a5 run style check 2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
163ddb680b drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2025-01-07 08:55:44 +00:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
hiyouga
58ab4579dc add vllm config 2024-11-10 21:28:18 +08:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
hiyouga
3af57795dd tiny fix 2024-10-11 23:51:54 +08:00
Johnny
e5849cdcce Update parser.py 2024-10-11 12:29:33 +02:00
hoshi-hiyouga
1ce0b42b1e Update parser.py 2024-10-07 16:27:23 +08:00
Johnny
4e638777eb Update parser.py 2024-10-07 10:17:45 +02:00
Johnny
6c1aef5560 Update parser.py 2024-10-06 20:34:19 +02:00
hiyouga
a45f3f5461 fix #5611 2024-10-06 10:34:55 +08:00
hiyouga
b6681d7198 support vllm 0.6.0 2024-09-08 02:26:20 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
a7dd7d325e update liger kernel 2024-08-29 20:46:08 +08:00
hiyouga
aa1afdc756 fix #5292 2024-08-29 20:37:47 +08:00
hiyouga
72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hiyouga
8f6995081c update parser 2024-07-19 01:36:39 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga
99ab7a8c1c allow computing rouge in training 2024-07-15 01:16:26 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hiyouga
88a20ba797 fix #4699
slow tokenizer for yi models
2024-07-14 15:34:22 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
8ed6b367e2 fix #4549 2024-06-28 00:41:58 +08:00
hiyouga
e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hiyouga
555ca8d780 lint 2024-06-25 02:55:50 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
hoshi-hiyouga
d0f953bf5b Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
2024-06-25 01:49:13 +08:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga
4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
Jonery
8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
Jonery
ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery
33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00