hiyouga
|
779aae83d2
|
follow #4878 fix #4684
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
1e7b396ff2
|
仅仅训练最后一轮对话
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
99ab7a8c1c
|
allow computing rouge in training
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
88a20ba797
|
fix #4699
slow tokenizer for yi models
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
575a02a23d
|
update hparams
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
e8e13b0942
|
move efficient_packing from data_args to model_args
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
8ed6b367e2
|
fix #4549
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
e44a4f07f0
|
tiny fix
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
555ca8d780
|
lint
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
d0f953bf5b
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
8d4f5093cf
|
tiny fix
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
4bd77d8563
|
fix #4357
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
8f7c78b641
|
fix typo
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
29c1f31baa
|
Update parser.py
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
d519b4d76d
|
disable DP
|
2024-06-15 04:57:19 +08:00 |
|
hiyouga
|
90e14a960d
|
tiny fix
|
2024-06-11 12:48:53 +08:00 |
|
hiyouga
|
24e1c0e2ee
|
fix #4022
|
2024-06-03 18:38:36 +08:00 |
|
hiyouga
|
8070871732
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
1e80a3a638
|
bump vllm version to 0.4.1
|
2024-05-28 21:27:27 +08:00 |
|
hiyouga
|
7c016b22aa
|
support DDP in webui
|
2024-05-28 19:24:22 +08:00 |
|
hiyouga
|
67ebc7b388
|
fix oom issues in export
|
2024-05-23 23:32:45 +08:00 |
|
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
|