hoshi-hiyouga
e1d574a784
[assets] update wechat ( #7161 )
...
Former-commit-id: 0c403ea15b
2025-03-05 14:11:10 +08:00
hoshi-hiyouga
caef0a8937
[data] use bicubic resampler ( #7143 )
...
Former-commit-id: bc298c60b7
2025-03-04 00:17:06 +08:00
hoshi-hiyouga
392533e139
[webui] fix webui ( #7142 )
...
Former-commit-id: 17ba2d5082
2025-03-04 00:01:49 +08:00
rabbit
299cd03785
[data] bailing template ( #7117 )
...
* add bailing template
* add bailing template
* add bailing template
---------
Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com >
Former-commit-id: 049ddf48af
2025-03-03 15:33:22 +08:00
hoshi-hiyouga
ee1b580328
[inference] fix hf_engine ( #7120 )
...
Former-commit-id: 1036311826
2025-03-01 05:22:49 +08:00
hoshi-hiyouga
54a090079c
[assets] update wechat ( #7106 )
...
Former-commit-id: d1863bbbaa
2025-02-28 12:01:04 +08:00
Ze-Yi LIN
210cdb9557
[webui] display swanlab exp link ( #7089 )
...
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 891c487503
2025-02-27 19:40:54 +08:00
leo-pony
e86cb8a4fa
[npu] update cann base image and torch 2.4 ( #7061 )
...
* Update base npu container image version:The Python version required for Hugging Face Transformers is >= python3.10
* Fix the bug: arg type of INSTALL_DEEPSPEED shoud been string now.
* Update Ascend CANN, CANN-Kernel and corresponding torch and torch-npu version
* Upgrade torch-npu needs packages' version: torch==2.1.0 and torch-npu==2.4.0.post2
Former-commit-id: acc52e0fe7
2025-02-25 23:32:01 +08:00
hoshi-hiyouga
f4aa0a146c
[misc] fix project toml ( #7067 )
...
Former-commit-id: 96fd510e6a
2025-02-25 23:22:48 +08:00
JieShen
96636c3729
[script] add seed args ( #7058 )
...
* add seed args
* add seed args
* update seed
Former-commit-id: e8266fe563
2025-02-25 19:44:57 +08:00
Kingsley
81947f1d2c
[model] add paligemma2-mix series ( #7060 )
...
Former-commit-id: 19861d5170
2025-02-25 18:51:16 +08:00
hoshi-hiyouga
dca5fe14c2
[data] fix mllama ( #7053 )
...
* fix mllama
* fix test
Former-commit-id: 76314e6ad1
2025-02-24 22:05:38 +08:00
hoshi-hiyouga
ca78ba964d
[model] add models ( #7054 )
...
* add qwen25vl awq models
* add moonlight
Former-commit-id: ec1a1bc118
2025-02-24 22:05:13 +08:00
hoshi-hiyouga
9359ee18ad
[assets] update readme ( #7051 )
...
Former-commit-id: fe6dd92c84
2025-02-24 20:45:06 +08:00
hoshi-hiyouga
15f3087b96
[assets] update wechat ( #7019 )
...
Former-commit-id: 1481af5dc9
2025-02-20 20:32:33 +08:00
Zhangchi Feng
1fcedf9af6
[data] fix MiniCPMV plugin ( #6998 )
...
* fix template
* fix bug in messages processing
Former-commit-id: cde479e47a
2025-02-19 19:36:04 +08:00
hoshi-hiyouga
b0bbacaacb
[webui] update css ( #6985 )
...
Former-commit-id: 302ecb00fe
2025-02-18 18:27:57 +08:00
hoshi-hiyouga
beb1a9f9d9
[data] add r1 distill dataset ( #6983 )
...
Former-commit-id: 2591a3fa8b
2025-02-18 17:25:09 +08:00
hoshi-hiyouga
3fbd4848e8
[version] support transformers 449 ( #6982 )
...
* support transformers 449
* fix mm plugin
Former-commit-id: b00b290c07
2025-02-18 17:05:40 +08:00
hoshi-hiyouga
184c5d0882
[misc] fix script ( #6977 )
...
Former-commit-id: cc8c7e762b
2025-02-18 17:00:46 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
a8c9d5663d
[data] fix predict dataset ( #6972 )
...
Former-commit-id: bdb581c4a8
2025-02-17 20:29:40 +08:00
hoshi-hiyouga
475a355b82
[assets] update wechat ( #6963 )
...
Former-commit-id: ad0c6c8916
2025-02-17 15:23:17 +08:00
Zhangchi Feng
3dc938268c
[data] fix minicpmo template ( #6946 )
...
Former-commit-id: 2faf8aeff8
2025-02-15 00:37:41 +08:00
Eric Tang
e55ec42d3c
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 6edd4992d7
2025-02-14 21:55:41 +08:00
hoshi-hiyouga
2baf8bf03d
[misc] fix lora regex ( #6944 )
...
* fix lora regex
* fix
Former-commit-id: 1ada3ae5a3
2025-02-14 21:38:43 +08:00
hoshi-hiyouga
13e1b7ee2b
[misc] fix grad ckpt ( #6931 )
...
Former-commit-id: c31c63b411
2025-02-13 23:27:51 +08:00
hoshi-hiyouga
cd493b91de
[model] add liger kernel to qwen2_5 vl ( #6930 )
...
* add liger kernel to qwen2_5 vl
* fix patch
* fix patch
Former-commit-id: 797043d29c
2025-02-13 23:05:54 +08:00
Billy Cao
48173b606c
[trainer] fix gen_kwarg to eval during training ( #5451 )
...
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 11eac71c13
2025-02-13 02:35:06 +08:00
SrWYG
0ad9f7f058
[data] evaluate on each dataset ( #5522 )
...
* [Update] loader.py , evaluate will run separate evaluations on each dataset.
`If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation`
seq2seqtrainner support eval_dataset as Dict.
* fix format
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 1e35967ae1
2025-02-13 02:19:03 +08:00
Noah
1adb46875f
[data] improve error handling ( #6128 )
...
* sync from upstream
* update
* update
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 4c7bfebcf1
2025-02-13 01:39:41 +08:00
hoshi-hiyouga
9b852ebe25
[misc] update readme ( #6918 )
...
Former-commit-id: 8956c93d9b
2025-02-13 01:01:41 +08:00
hoshi-hiyouga
07aa7b71a3
[misc] update readme ( #6917 )
...
Former-commit-id: 499ea45d1f
2025-02-13 00:58:10 +08:00
hoshi-hiyouga
1679930e00
[breaking change] refactor data pipeline ( #6901 )
...
* refactor data
* rename file
Former-commit-id: 617c8ab467
2025-02-13 00:39:20 +08:00
Eric Tang
d50e04b805
[misc] support for launching LLaMA-Factory with uv run ( #6907 )
...
* yay
* uv with ray temporary commit
* remove ray specific code for now
* cleanup
Former-commit-id: f8a206125d
2025-02-13 00:38:44 +08:00
Eric Tang
e515fe62de
[example] fix path to ray example ( #6906 )
...
Former-commit-id: ee5fe216dc
2025-02-13 00:29:32 +08:00
hoshi-hiyouga
036fb0d561
[misc] fix grad ckpt func ( #6916 )
...
Former-commit-id: e34c3c06da
2025-02-13 00:17:18 +08:00
marko1616
bae934dea3
[trainer] fix llama3.2 vision kto train ( #6904 )
...
Former-commit-id: b7fd1e9c00
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
2e2f6bea07
[data] feat: auto template ( #6905 )
...
* support auto template
* add unittest
Former-commit-id: 2f8b6847f5
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
1b02183da9
[misc] update readme ( #6903 )
...
Former-commit-id: 18179a3823
2025-02-11 22:51:26 +08:00
hoshi-hiyouga
197aa3baf4
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
2e954d8fd2
[data] refactor template ( #6896 )
...
Former-commit-id: d1b8aa3835
2025-02-11 17:59:25 +08:00
codingma
fafa3add84
support ollama modelfile export ( #4686 )
...
Former-commit-id: 7f354b80bc
2025-02-11 17:52:24 +08:00
hoshi-hiyouga
593acca556
[data] refactor mm plugin ( #6895 )
...
* refactor plugin
* lint
Former-commit-id: aca63bfcca
2025-02-11 16:34:49 +08:00
HJ
188f22d8a7
[data] fix qwen_2_5_vl video processing ( #6868 )
...
* fix qwen_2_5_vl video processing
* Update mm_plugin.py
* Update mm_plugin.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 9153a7bd83
2025-02-11 16:14:50 +08:00
hoshi-hiyouga
703bb9cc18
[assets] update wechat ( #6892 )
...
Former-commit-id: fc5d47401f
2025-02-11 13:56:26 +08:00
Zhangchi Feng
5433b318bb
[da'ta] fix minicpmv plugin ( #6890 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
* fix no mm inputs
Former-commit-id: 764627645a
2025-02-11 13:30:44 +08:00
HJ
fe4f4e9758
[data] fix: sharegpt converter ( #6879 )
...
* fix-sharegpt-format
* fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 0fb44cb3a5
2025-02-10 21:59:12 +08:00