hiyouga
|
24da9f59b0
|
fix #5883
|
2024-11-02 13:06:34 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
3af57795dd
|
tiny fix
|
2024-10-11 23:51:54 +08:00 |
|
hoshi-hiyouga
|
228dd1739e
|
Merge pull request #5665 from johnnynunez/main
vllm 0.6.3
|
2024-10-11 23:45:58 +08:00 |
|
Johnny
|
e5849cdcce
|
Update parser.py
|
2024-10-11 12:29:33 +02:00 |
|
huniu20
|
7b91be33c9
|
add om_hub_token argument
|
2024-10-10 17:16:46 +08:00 |
|
hoshi-hiyouga
|
1ce0b42b1e
|
Update parser.py
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
4e638777eb
|
Update parser.py
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
6c1aef5560
|
Update parser.py
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
a45f3f5461
|
fix #5611
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
fe7ffccdb9
|
fix #5542
|
2024-09-30 23:28:55 +08:00 |
|
hiyouga
|
b6681d7198
|
support vllm 0.6.0
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
fb72a3adb0
|
support activation offloading via unsloth gc
|
2024-09-08 01:22:19 +08:00 |
|
hiyouga
|
94d5b1bd8f
|
add e2e tests
|
2024-09-05 21:52:28 +08:00 |
|
hiyouga
|
8cafc7b055
|
video datasets
|
2024-09-05 02:04:17 +08:00 |
|
hiyouga
|
dabad5570b
|
update get template
|
2024-09-04 22:36:20 +08:00 |
|
hoshi-hiyouga
|
8f441c2b3a
|
Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
Add batch size of map function in the preprocessed dataset
|
2024-09-04 22:09:36 +08:00 |
|
hiyouga
|
47ea97fb1b
|
lazy image load
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
59d2b31e96
|
fix #5334
|
2024-09-03 19:09:42 +08:00 |
|
naem1023
|
209313eeea
|
feat: add batch size of map function in the preprocessed dataset
|
2024-09-02 13:52:47 +09:00 |
|
hiyouga
|
8e49940746
|
add rlhf-v dataset
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
a7dd7d325e
|
update liger kernel
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
aa1afdc756
|
fix #5292
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
72bc8f0111
|
support liger kernel
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
|
hoshi-hiyouga
|
ef482394f0
|
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
|
2024-08-09 19:51:33 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
“Wzw”
|
2fa1e0b2ad
|
mask_history args verify valid
|
2024-08-08 10:12:01 +08:00 |
|
moontidef
|
82bc15dc79
|
feat: add support for adammini
|
2024-08-07 10:08:22 +08:00 |
|
hiyouga
|
8f6995081c
|
update parser
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
779aae83d2
|
follow #4878 fix #4684
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
1e7b396ff2
|
仅仅训练最后一轮对话
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
codingma
|
645211dc01
|
1. change the task name format
2. delete split param in data_args.py
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
99ab7a8c1c
|
allow computing rouge in training
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
15b399a82f
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
cba673f491
|
Update data_args.py
|
2024-07-15 00:56:03 +08:00 |
|
hiyouga
|
88a20ba797
|
fix #4699
slow tokenizer for yi models
|
2024-07-14 15:34:22 +08:00 |
|
codingma
|
76f3bbcfc0
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
cce7083024
|
update packing
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
575a02a23d
|
update hparams
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
e8e13b0942
|
move efficient_packing from data_args to model_args
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
e8e6af2651
|
Merge branch 'main' into main
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
1771251ce3
|
fix #4402 #4617
Deprecate reserved_label_len arg
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
64f4337dac
|
increase pissa_iter for stability
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|