Commit Graph

110 Commits

Author SHA1 Message Date
hiyouga
24da9f59b0 fix #5883 2024-11-02 13:06:34 +08:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
hiyouga
3af57795dd tiny fix 2024-10-11 23:51:54 +08:00
hoshi-hiyouga
228dd1739e Merge pull request #5665 from johnnynunez/main
vllm 0.6.3
2024-10-11 23:45:58 +08:00
Johnny
e5849cdcce Update parser.py 2024-10-11 12:29:33 +02:00
huniu20
7b91be33c9 add om_hub_token argument 2024-10-10 17:16:46 +08:00
hoshi-hiyouga
1ce0b42b1e Update parser.py 2024-10-07 16:27:23 +08:00
Johnny
4e638777eb Update parser.py 2024-10-07 10:17:45 +02:00
Johnny
6c1aef5560 Update parser.py 2024-10-06 20:34:19 +02:00
hiyouga
a45f3f5461 fix #5611 2024-10-06 10:34:55 +08:00
hiyouga
fe7ffccdb9 fix #5542 2024-09-30 23:28:55 +08:00
hiyouga
b6681d7198 support vllm 0.6.0 2024-09-08 02:26:20 +08:00
hiyouga
fb72a3adb0 support activation offloading via unsloth gc 2024-09-08 01:22:19 +08:00
hiyouga
94d5b1bd8f add e2e tests 2024-09-05 21:52:28 +08:00
hiyouga
8cafc7b055 video datasets 2024-09-05 02:04:17 +08:00
hiyouga
dabad5570b update get template 2024-09-04 22:36:20 +08:00
hoshi-hiyouga
8f441c2b3a Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
Add batch size of map function in the preprocessed dataset
2024-09-04 22:09:36 +08:00
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
59d2b31e96 fix #5334 2024-09-03 19:09:42 +08:00
naem1023
209313eeea feat: add batch size of map function in the preprocessed dataset 2024-09-02 13:52:47 +09:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
a7dd7d325e update liger kernel 2024-08-29 20:46:08 +08:00
hiyouga
aa1afdc756 fix #5292 2024-08-29 20:37:47 +08:00
hiyouga
72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hoshi-hiyouga
ef482394f0 Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
2024-08-09 19:51:33 +08:00
hiyouga
c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
“Wzw”
2fa1e0b2ad mask_history args verify valid 2024-08-08 10:12:01 +08:00
moontidef
82bc15dc79 feat: add support for adammini 2024-08-07 10:08:22 +08:00
hiyouga
8f6995081c update parser 2024-07-19 01:36:39 +08:00
hiyouga
779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
Shiyu Zhang
1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
codingma
645211dc01 1. change the task name format
2. delete split param in data_args.py
2024-07-15 09:55:33 +08:00
hiyouga
99ab7a8c1c allow computing rouge in training 2024-07-15 01:16:26 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hoshi-hiyouga
15b399a82f Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
2024-07-15 01:00:34 +08:00
hoshi-hiyouga
cba673f491 Update data_args.py 2024-07-15 00:56:03 +08:00
hiyouga
88a20ba797 fix #4699
slow tokenizer for yi models
2024-07-14 15:34:22 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
cce7083024 update packing 2024-07-04 01:10:55 +08:00
hiyouga
575a02a23d update hparams 2024-07-03 23:18:58 +08:00
ancv
e8e13b0942 move efficient_packing from data_args to model_args 2024-07-02 18:37:55 +07:00
hoshi-hiyouga
e8e6af2651 Merge branch 'main' into main 2024-07-01 21:01:09 +08:00
hiyouga
1771251ce3 fix #4402 #4617
Deprecate reserved_label_len arg
2024-07-01 01:19:27 +08:00
hiyouga
64f4337dac increase pissa_iter for stability 2024-06-28 03:18:54 +08:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00