hiyouga
|
3af57795dd
|
tiny fix
|
2024-10-11 23:51:54 +08:00 |
|
Johnny
|
e5849cdcce
|
Update parser.py
|
2024-10-11 12:29:33 +02:00 |
|
hoshi-hiyouga
|
1ce0b42b1e
|
Update parser.py
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
4e638777eb
|
Update parser.py
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
6c1aef5560
|
Update parser.py
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
a45f3f5461
|
fix #5611
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
b6681d7198
|
support vllm 0.6.0
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
a7dd7d325e
|
update liger kernel
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
aa1afdc756
|
fix #5292
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
72bc8f0111
|
support liger kernel
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
hiyouga
|
8f6995081c
|
update parser
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
779aae83d2
|
follow #4878 fix #4684
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
1e7b396ff2
|
仅仅训练最后一轮对话
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
99ab7a8c1c
|
allow computing rouge in training
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
88a20ba797
|
fix #4699
slow tokenizer for yi models
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
575a02a23d
|
update hparams
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
e8e13b0942
|
move efficient_packing from data_args to model_args
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
8ed6b367e2
|
fix #4549
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
e44a4f07f0
|
tiny fix
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
555ca8d780
|
lint
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
d0f953bf5b
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
8d4f5093cf
|
tiny fix
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
4bd77d8563
|
fix #4357
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
8f7c78b641
|
fix typo
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |
|
hoshi-hiyouga
|
29c1f31baa
|
Update parser.py
|
2024-06-16 02:57:00 +08:00 |
|
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
d519b4d76d
|
disable DP
|
2024-06-15 04:57:19 +08:00 |
|
hiyouga
|
90e14a960d
|
tiny fix
|
2024-06-11 12:48:53 +08:00 |
|
hiyouga
|
24e1c0e2ee
|
fix #4022
|
2024-06-03 18:38:36 +08:00 |
|
hiyouga
|
8070871732
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
1e80a3a638
|
bump vllm version to 0.4.1
|
2024-05-28 21:27:27 +08:00 |
|
hiyouga
|
7c016b22aa
|
support DDP in webui
|
2024-05-28 19:24:22 +08:00 |
|
hiyouga
|
67ebc7b388
|
fix oom issues in export
|
2024-05-23 23:32:45 +08:00 |
|
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
|