Commit Graph

114 Commits

Author SHA1 Message Date
hiyouga
e99031daa4 fix inputs
Former-commit-id: 446441fdb0
2024-11-23 18:26:02 +00:00
Ting
e27a0c3d53 code refactor
Former-commit-id: 40627c601e
2024-11-19 20:33:18 +08:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
e83cb17f97 support rank0 logger
Former-commit-id: c38aa29336
2024-11-02 18:31:04 +08:00
hiyouga
7fa46a24df fix #5883
Former-commit-id: 24da9f59b0
2024-11-02 13:06:34 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dd
2024-10-11 23:51:54 +08:00
hoshi-hiyouga
012f4fef6b Merge pull request #5665 from johnnynunez/main
vllm 0.6.3

Former-commit-id: 228dd1739e
2024-10-11 23:45:58 +08:00
Johnny
27be1e2122 Update parser.py
Former-commit-id: e5849cdcce
2024-10-11 12:29:33 +02:00
huniu20
e8e98bb125 add om_hub_token argument
Former-commit-id: 7b91be33c9
2024-10-10 17:16:46 +08:00
hoshi-hiyouga
b855d3421e Update parser.py
Former-commit-id: 1ce0b42b1e
2024-10-07 16:27:23 +08:00
Johnny
059c2ffbea Update parser.py
Former-commit-id: 4e638777eb
2024-10-07 10:17:45 +02:00
Johnny
9a6045eee6 Update parser.py
Former-commit-id: 6c1aef5560
2024-10-06 20:34:19 +02:00
hiyouga
56132983cf fix #5611
Former-commit-id: a45f3f5461
2024-10-06 10:34:55 +08:00
hiyouga
4df090ff48 fix #5542
Former-commit-id: fe7ffccdb9
2024-09-30 23:28:55 +08:00
hiyouga
78cf256067 support vllm 0.6.0
Former-commit-id: b6681d7198
2024-09-08 02:26:20 +08:00
hiyouga
0daee7cb39 support activation offloading via unsloth gc
Former-commit-id: fb72a3adb0
2024-09-08 01:22:19 +08:00
hiyouga
3aa6a3e45b add e2e tests
Former-commit-id: 94d5b1bd8f
2024-09-05 21:52:28 +08:00
hiyouga
9df7a26e6b video datasets
Former-commit-id: 8cafc7b055
2024-09-05 02:04:17 +08:00
hiyouga
d5ea05cfff update get template
Former-commit-id: dabad5570b
2024-09-04 22:36:20 +08:00
hoshi-hiyouga
1dfd1aaf82 Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
Add batch size of map function in the preprocessed dataset

Former-commit-id: 8f441c2b3a
2024-09-04 22:09:36 +08:00
hiyouga
22deca0e9e lazy image load
Former-commit-id: 47ea97fb1b
2024-09-04 02:27:08 +08:00
hiyouga
5ef58eb655 fix #5334
Former-commit-id: 59d2b31e96
2024-09-03 19:09:42 +08:00
naem1023
46695e42cc feat: add batch size of map function in the preprocessed dataset
Former-commit-id: 209313eeea
2024-09-02 13:52:47 +09:00
hiyouga
bfdcc6bacf add rlhf-v dataset
Former-commit-id: 8e49940746
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32
2024-08-30 02:14:31 +08:00
hiyouga
0e4ee9d9a3 update liger kernel
Former-commit-id: a7dd7d325e
2024-08-29 20:46:08 +08:00
hiyouga
f153ee13be fix #5292
Former-commit-id: aa1afdc756
2024-08-29 20:37:47 +08:00
hiyouga
c765292093 support liger kernel
Former-commit-id: 72bc8f0111
2024-08-27 11:20:14 +08:00
hiyouga
5eacd17090 add adam_mini to readme
Former-commit-id: e2a28f51c6
2024-08-09 20:02:03 +08:00
hoshi-hiyouga
792da85866 Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer

Former-commit-id: ef482394f0
2024-08-09 19:51:33 +08:00
hiyouga
b5146facff follow #5115
Former-commit-id: c87023d539
2024-08-09 18:03:00 +08:00
“Wzw”
13e5fff97a mask_history args verify valid
Former-commit-id: 2fa1e0b2ad
2024-08-08 10:12:01 +08:00
moontidef
44f7c4dd56 feat: add support for adammini
Former-commit-id: 82bc15dc79
2024-08-07 10:08:22 +08:00
hiyouga
542658c986 update parser
Former-commit-id: 8f6995081c
2024-07-19 01:36:39 +08:00
hiyouga
34f16cc635 follow #4878 fix #4684
Former-commit-id: 779aae83d2
2024-07-18 22:06:12 +08:00
Shiyu Zhang
249adacc4d 仅仅训练最后一轮对话
Former-commit-id: 1e7b396ff2
2024-07-18 15:30:25 +08:00
hiyouga
e90fae61f4 support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f12
2024-07-17 00:33:00 +08:00
codingma
76046dfda8 1. change the task name format
2. delete split param in data_args.py


Former-commit-id: 645211dc01
2024-07-15 09:55:33 +08:00
hiyouga
22859b8734 allow computing rouge in training
Former-commit-id: 99ab7a8c1c
2024-07-15 01:16:26 +08:00
hiyouga
14bc7b0551 fix up
Former-commit-id: 29ebcd75d5
2024-07-15 01:04:56 +08:00
hoshi-hiyouga
2b22a7da48 Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support

Former-commit-id: 15b399a82f
2024-07-15 01:00:34 +08:00
hoshi-hiyouga
788dc1c679 Update data_args.py
Former-commit-id: cba673f491
2024-07-15 00:56:03 +08:00
hiyouga
dfd2d912cd fix #4699
slow tokenizer for yi models


Former-commit-id: 88a20ba797
2024-07-14 15:34:22 +08:00
codingma
74f0d02eb8 1. add custom eval dataset support
2. merge load dataset and split dataset function


Former-commit-id: 76f3bbcfc0
2024-07-05 15:52:10 +08:00
hiyouga
7b3c1f29ff fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530
2024-07-04 01:52:43 +08:00
hiyouga
bfdaadcc40 update packing
Former-commit-id: cce7083024
2024-07-04 01:10:55 +08:00
hiyouga
ff6fc666c1 update hparams
Former-commit-id: 575a02a23d
2024-07-03 23:18:58 +08:00
ancv
7f42932957 move efficient_packing from data_args to model_args
Former-commit-id: e8e13b0942
2024-07-02 18:37:55 +07:00