hoshi-hiyouga
aaf2e6ba2a
[model] fix kv cache ( #7564 )
2025-04-01 23:07:46 +08:00
Billy Cao
5d1cc863a4
[data] shard the dataset to allow multiprocessing when streaming is enabled ( #7530 )
...
* Shard the dataset when streaming to allow multiprocessing
* Allow user to not set dataset_shards to ensure backward compatibility
2025-04-01 15:36:23 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
steveepreston
8895cf1152
Update val_size
english description ( #6653 )
...
* Update `val_size` Description in locales.py
* Update `val_size` Description in data_args.py
* Remove extra space in data_args.py
Former-commit-id: 76675b654e243c14b260adbfe04f619e4f2bf177
2025-01-15 16:00:20 +08:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
hiyouga
7fa46a24df
fix #5883
...
Former-commit-id: 24da9f59b0bf4874506bbf1ec214f3d5ca43d943
2024-11-02 13:06:34 +08:00
hiyouga
d5ea05cfff
update get template
...
Former-commit-id: dabad5570bf4a6b1044c963d8f27717030f373ef
2024-09-04 22:36:20 +08:00
hoshi-hiyouga
1dfd1aaf82
Merge pull request #5323 from naem1023/feat/add-dataset-map-batch-size-argument
...
Add batch size of map function in the preprocessed dataset
Former-commit-id: 8f441c2b3a5bb84dec2c037a541084c0201726c6
2024-09-04 22:09:36 +08:00
hiyouga
22deca0e9e
lazy image load
...
Former-commit-id: 47ea97fb1ba77de2e8a561904aa8fdc27c3f5025
2024-09-04 02:27:08 +08:00
naem1023
46695e42cc
feat: add batch size of map function in the preprocessed dataset
...
Former-commit-id: 209313eeeab8d1a7c320bd9aa90a5f4656082b7c
2024-09-02 13:52:47 +09:00
hiyouga
b5146facff
follow #5115
...
Former-commit-id: c87023d539875cd8e622d40212a5627c9c182fb8
2024-08-09 18:03:00 +08:00
“Wzw”
13e5fff97a
mask_history args verify valid
...
Former-commit-id: 2fa1e0b2add60142c178e5e21ebaad7132fa5b00
2024-08-08 10:12:01 +08:00
hiyouga
34f16cc635
follow #4878 fix #4684
...
Former-commit-id: 779aae83d253de0a86201ff87543b5d695e28d23
2024-07-18 22:06:12 +08:00
Shiyu Zhang
249adacc4d
仅仅训练最后一轮对话
...
Former-commit-id: 1e7b396ff2489055574fd3365425d26360d73897
2024-07-18 15:30:25 +08:00
codingma
76046dfda8
1. change the task name format
...
2. delete split param in data_args.py
Former-commit-id: 645211dc01b5d4db3ccd0e3dce03a53860eded26
2024-07-15 09:55:33 +08:00
hoshi-hiyouga
788dc1c679
Update data_args.py
...
Former-commit-id: cba673f491c5d97aba62aea03f310bd54fb3fe28
2024-07-15 00:56:03 +08:00
codingma
74f0d02eb8
1. add custom eval dataset support
...
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0e11aa41f8f5cbebc60b77b987f7901
2024-07-05 15:52:10 +08:00
hiyouga
7b3c1f29ff
fix packing for eager/sdpa attn
...
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
2024-07-04 01:52:43 +08:00
hiyouga
bfdaadcc40
update packing
...
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
2024-07-04 01:10:55 +08:00
hiyouga
ff6fc666c1
update hparams
...
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
2024-07-03 23:18:58 +08:00
ancv
7f42932957
move efficient_packing from data_args to model_args
...
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
2024-07-02 18:37:55 +07:00
hoshi-hiyouga
2452f57cd7
Merge branch 'main' into main
...
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
2024-07-01 21:01:09 +08:00
hiyouga
ca7b65439d
fix #4402 #4617
...
Deprecate reserved_label_len arg
Former-commit-id: 1771251ce3f6887b301dac10f3de7a253c5e5884
2024-07-01 01:19:27 +08:00
hiyouga
d519c2fde5
tiny fix
...
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
2024-06-25 01:15:19 +08:00
mMrBun
c0e005e2ea
Add tool_format to overwrite tool formatter template
...
Former-commit-id: 20e2e6fdcb0cd1771906be035745a2d9fcd3e138
2024-06-22 02:13:23 +08:00
ancv
5319447aa5
move configure_packing to llamafactory.model.patcher and fix constants
...
Former-commit-id: 770f75dc8363bfa284a72159ff8ad25ec9abe4e0
2024-06-21 00:45:06 +07:00
hiyouga
c0c6b8075a
tiny fix
...
Former-commit-id: 38b6b0f52edeb8ba45aa03b415b3c0c1b0e0c1e4
2024-06-16 01:06:41 +08:00
hiyouga
2946153cea
add license
...
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
ancv
045eb155a2
implement efficient packing without cross-contamination attention
...
Former-commit-id: b2c367bc61c2778dc359613dca496d9e134c2743
2024-06-12 11:56:01 +07:00
hiyouga
cae823ddf0
rename package
...
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00