hoshi-hiyouga
|
012f4fef6b
|
Merge pull request #5665 from johnnynunez/main
vllm 0.6.3
Former-commit-id: 228dd1739e98a8ea0270c40edff1f30591c30059
|
2024-10-11 23:45:58 +08:00 |
|
Johnny
|
27be1e2122
|
Update parser.py
Former-commit-id: e5849cdcce109e15547edcf9a692e7c13d625e5a
|
2024-10-11 12:29:33 +02:00 |
|
huniu20
|
e8e98bb125
|
add om_hub_token argument
Former-commit-id: 7b91be33c9cd8473453716f0c4c6dec924304efc
|
2024-10-10 17:16:46 +08:00 |
|
hoshi-hiyouga
|
b855d3421e
|
Update parser.py
Former-commit-id: 1ce0b42b1e30cb5419c91702a499f23d52db43ee
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
059c2ffbea
|
Update parser.py
Former-commit-id: 4e638777ebcbf7dea22011361fb341bafe6ba9d9
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
9a6045eee6
|
Update parser.py
Former-commit-id: 6c1aef55604649a956fe928d89280626923815b8
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
56132983cf
|
fix #5611
Former-commit-id: a45f3f5461e2936b9e119eda2ef4d8c7a4131740
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
4df090ff48
|
fix #5542
Former-commit-id: fe7ffccdb9a45b31e20ab7e88282a75b45504a97
|
2024-09-30 23:28:55 +08:00 |
|
hiyouga
|
78cf256067
|
support vllm 0.6.0
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
0daee7cb39
|
support activation offloading via unsloth gc
Former-commit-id: fb72a3adb0916232cc9ac9f0c725c02d07b9354c
|
2024-09-08 01:22:19 +08:00 |
|
hiyouga
|
3aa6a3e45b
|
add e2e tests
Former-commit-id: 94d5b1bd8f49dabeb9e3c53d634cfb3c06b0241d
|
2024-09-05 21:52:28 +08:00 |
|
hiyouga
|
9df7a26e6b
|
video datasets
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
|
2024-09-05 02:04:17 +08:00 |
|
hiyouga
|
d5ea05cfff
|
update get template
Former-commit-id: dabad5570bf4a6b1044c963d8f27717030f373ef
|
2024-09-04 22:36:20 +08:00 |
|
hoshi-hiyouga
|
1dfd1aaf82
|
Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
Add batch size of map function in the preprocessed dataset
Former-commit-id: 8f441c2b3a5bb84dec2c037a541084c0201726c6
|
2024-09-04 22:09:36 +08:00 |
|
hiyouga
|
22deca0e9e
|
lazy image load
Former-commit-id: 47ea97fb1ba77de2e8a561904aa8fdc27c3f5025
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
5ef58eb655
|
fix #5334
Former-commit-id: 59d2b31e968677263f005f57ae8a56fc758307a7
|
2024-09-03 19:09:42 +08:00 |
|
naem1023
|
46695e42cc
|
feat: add batch size of map function in the preprocessed dataset
Former-commit-id: 209313eeeab8d1a7c320bd9aa90a5f4656082b7c
|
2024-09-02 13:52:47 +09:00 |
|
hiyouga
|
bfdcc6bacf
|
add rlhf-v dataset
Former-commit-id: 8e49940746c1a6ff910f07dbefbec14af9d0f3c6
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
f31e7e0dfc
|
remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
0e4ee9d9a3
|
update liger kernel
Former-commit-id: a7dd7d325e68c92c7470c1e9ef83a7c8abcbc616
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
f153ee13be
|
fix #5292
Former-commit-id: aa1afdc75614868172bd2f9c052647b8f226d3f2
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
c765292093
|
support liger kernel
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
5eacd17090
|
add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
|
2024-08-09 20:02:03 +08:00 |
|
hoshi-hiyouga
|
792da85866
|
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
Former-commit-id: ef482394f0e2820ee8a245f8a6b050a32591b40a
|
2024-08-09 19:51:33 +08:00 |
|
hiyouga
|
b5146facff
|
follow #5115
Former-commit-id: c87023d539875cd8e622d40212a5627c9c182fb8
|
2024-08-09 18:03:00 +08:00 |
|
“Wzw”
|
13e5fff97a
|
mask_history args verify valid
Former-commit-id: 2fa1e0b2add60142c178e5e21ebaad7132fa5b00
|
2024-08-08 10:12:01 +08:00 |
|
moontidef
|
44f7c4dd56
|
feat: add support for adammini
Former-commit-id: 82bc15dc795f95768b81c25eaaabdc613da30cd8
|
2024-08-07 10:08:22 +08:00 |
|
hiyouga
|
542658c986
|
update parser
Former-commit-id: 8f6995081cbdbb2424da586a443e5220a8990faa
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
34f16cc635
|
follow #4878 fix #4684
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
249adacc4d
|
仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
codingma
|
76046dfda8
|
1. change the task name format
2. delete split param in data_args.py
Former-commit-id: 645211dc01b5d4db3ccd0e3dce03a53860eded26
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
22859b8734
|
allow computing rouge in training
Former-commit-id: 99ab7a8c1c966232faa11b6a42b9740d9a20ace3
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d55f70f2891632eba187b643cc3a9e51
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
2b22a7da48
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 15b399a82f45b08fc07d2957884fb7821eba9fd9
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
788dc1c679
|
Update data_args.py
Former-commit-id: cba673f491c5d97aba62aea03f310bd54fb3fe28
|
2024-07-15 00:56:03 +08:00 |
|
hiyouga
|
dfd2d912cd
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 88a20ba7972c533d650967a118d612471fe2b2e8
|
2024-07-14 15:34:22 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
ff6fc666c1
|
update hparams
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
7f42932957
|
move efficient_packing from data_args to model_args
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
ca7b65439d
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 1771251ce3f6887b301dac10f3de7a253c5e5884
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
b0acd27114
|
increase pissa_iter for stability
Former-commit-id: 64f4337daca4c914d86a7181dd582508688383cd
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0fb9624807d809832f097301982d192
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
a294ef2fae
|
fix #4549
Former-commit-id: 8ed6b367e26490acab5d2d7b32f0d5dad449d26a
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
7c488cea57
|
tiny fix
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
|
2024-06-27 00:29:42 +08:00 |
|