Commit Graph

48 Commits

Author SHA1 Message Date
hiyouga
3af57795dd tiny fix 2024-10-11 23:51:54 +08:00
Johnny
e5849cdcce Update parser.py 2024-10-11 12:29:33 +02:00
hoshi-hiyouga
1ce0b42b1e Update parser.py 2024-10-07 16:27:23 +08:00
Johnny
4e638777eb Update parser.py 2024-10-07 10:17:45 +02:00
Johnny
6c1aef5560 Update parser.py 2024-10-06 20:34:19 +02:00
hiyouga
a45f3f5461 fix #5611 2024-10-06 10:34:55 +08:00
hiyouga
b6681d7198 support vllm 0.6.0 2024-09-08 02:26:20 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
a7dd7d325e update liger kernel 2024-08-29 20:46:08 +08:00
hiyouga
aa1afdc756 fix #5292 2024-08-29 20:37:47 +08:00
hiyouga
72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hiyouga
8f6995081c update parser 2024-07-19 01:36:39 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga
99ab7a8c1c allow computing rouge in training 2024-07-15 01:16:26 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hiyouga
88a20ba797 fix #4699
slow tokenizer for yi models
2024-07-14 15:34:22 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
8ed6b367e2 fix #4549 2024-06-28 00:41:58 +08:00
hiyouga
e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hiyouga
555ca8d780 lint 2024-06-25 02:55:50 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
hoshi-hiyouga
d0f953bf5b Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
2024-06-25 01:49:13 +08:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga
4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
Jonery
8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
Jonery
ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery
33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00
hoshi-hiyouga
29c1f31baa Update parser.py 2024-06-16 02:57:00 +08:00
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
d519b4d76d disable DP 2024-06-15 04:57:19 +08:00
hiyouga
90e14a960d tiny fix 2024-06-11 12:48:53 +08:00
hiyouga
24e1c0e2ee fix #4022 2024-06-03 18:38:36 +08:00
hiyouga
8070871732 better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
2024-05-29 23:55:38 +08:00
hiyouga
1e80a3a638 bump vllm version to 0.4.1 2024-05-28 21:27:27 +08:00
hiyouga
7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga
67ebc7b388 fix oom issues in export 2024-05-23 23:32:45 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00