Commit Graph

33 Commits

Author SHA1 Message Date
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga
99ab7a8c1c allow computing rouge in training 2024-07-15 01:16:26 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hiyouga
88a20ba797 fix #4699
slow tokenizer for yi models
2024-07-14 15:34:22 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
8ed6b367e2 fix #4549 2024-06-28 00:41:58 +08:00
hiyouga
e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hiyouga
555ca8d780 lint 2024-06-25 02:55:50 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
hoshi-hiyouga
d0f953bf5b Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
2024-06-25 01:49:13 +08:00
hiyouga
8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga
4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
Jonery
8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
Jonery
ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery
33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00
hoshi-hiyouga
29c1f31baa Update parser.py 2024-06-16 02:57:00 +08:00
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
d519b4d76d disable DP 2024-06-15 04:57:19 +08:00
hiyouga
90e14a960d tiny fix 2024-06-11 12:48:53 +08:00
hiyouga
24e1c0e2ee fix #4022 2024-06-03 18:38:36 +08:00
hiyouga
8070871732 better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
2024-05-29 23:55:38 +08:00
hiyouga
1e80a3a638 bump vllm version to 0.4.1 2024-05-28 21:27:27 +08:00
hiyouga
7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga
67ebc7b388 fix oom issues in export 2024-05-23 23:32:45 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00