Kingsley
185c76f6ad
[model] add Qwen2.5-Omni model ( #7537 )
...
* preserve image_sizes
* preserve image_sizes
* init plugin
* support audio-text2text lora
* nit
* support image/video-text2text, audio-text2text
* remove args
* remove lines
* add docs && nit
* remove some comments
* fix && add merge part script
* add license
2025-03-31 20:39:35 +08:00
Kingsley
b00cb2ed42
[data] fix pixtral plugin ( #7505 )
...
* preserve `image_sizes`
* add comments
2025-03-27 17:06:40 +08:00
hoshi-hiyouga
42e090d38b
[trainer] fix vlm loss for transformers 4.49 ( #7448 )
2025-03-24 10:24:05 +08:00
hoshi-hiyouga
ef5f1c1def
[data] gemma3 plugin pan and scan ( #7294 )
...
* gemma3 pan and scan
* add test case
* fix test
2025-03-13 23:29:23 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
165d3ed084
[model] support gemma3 ( #7273 )
2025-03-13 01:35:23 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
2b21c749c1
[data] fix mm template ( #7181 )
...
Former-commit-id: be66df1f0211cd2d90eac3ab407dced653c9e443
2025-03-06 15:18:32 +08:00
hoshi-hiyouga
6e58115f98
[trainer] update config ( #7174 )
...
Former-commit-id: b4b89b4ff3bc03aa388569e253d62580755a77a5
2025-03-05 23:32:54 +08:00
sirui.li
8dddffa340
[data] fix qwen2audio plugin ( #7166 )
...
* Update pairwise.py
[data]Repair multimodal model dpo training
* Update pairwise.py
[data]repair multimodal model dpo training using deepcopy
* Update pairwise.py
* Update mm_plugin.py
Former-commit-id: dff4130969bac9cb1abe66fd5dfada8c757c716f
2025-03-05 18:03:36 +08:00
hoshi-hiyouga
caef0a8937
[data] use bicubic resampler ( #7143 )
...
Former-commit-id: bc298c60b7d3fdc4d116a79b535d7e9b11f4aa65
2025-03-04 00:17:06 +08:00
hoshi-hiyouga
dca5fe14c2
[data] fix mllama ( #7053 )
...
* fix mllama
* fix test
Former-commit-id: 76314e6ad1ecaa44fcae4375dd0abf4ebaf1f924
2025-02-24 22:05:38 +08:00
Zhangchi Feng
1fcedf9af6
[data] fix MiniCPMV plugin ( #6998 )
...
* fix template
* fix bug in messages processing
Former-commit-id: cde479e47a51beb60ab555cdee083c1cdba0ead6
2025-02-19 19:36:04 +08:00
hoshi-hiyouga
3fbd4848e8
[version] support transformers 449 ( #6982 )
...
* support transformers 449
* fix mm plugin
Former-commit-id: b00b290c07beb560a5af857ce64f4ce424831a2c
2025-02-18 17:05:40 +08:00
hoshi-hiyouga
184c5d0882
[misc] fix script ( #6977 )
...
Former-commit-id: cc8c7e762b9c873ef79529152465bbed9231053c
2025-02-18 17:00:46 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710c9b13ab450815a92fff14b03251984
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301709326efa21e7a3fdb75fe0a9635c2
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
2baf8bf03d
[misc] fix lora regex ( #6944 )
...
* fix lora regex
* fix
Former-commit-id: 1ada3ae5a3a14057341540c6d6ba985adf95f348
2025-02-14 21:38:43 +08:00
hoshi-hiyouga
1679930e00
[breaking change] refactor data pipeline ( #6901 )
...
* refactor data
* rename file
Former-commit-id: 617c8ab467d32be5f7d5c94fa89c0e3d7d1963bc
2025-02-13 00:39:20 +08:00
hoshi-hiyouga
593acca556
[data] refactor mm plugin ( #6895 )
...
* refactor plugin
* lint
Former-commit-id: aca63bfcca02ecd95b57cd8949a50e26a913f716
2025-02-11 16:34:49 +08:00
HJ
188f22d8a7
[data] fix qwen_2_5_vl video processing ( #6868 )
...
* fix qwen_2_5_vl video processing
* Update mm_plugin.py
* Update mm_plugin.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 9153a7bd832cdae84b63a4d7d1f2b12239e84b61
2025-02-11 16:14:50 +08:00
Zhangchi Feng
5433b318bb
[da'ta] fix minicpmv plugin ( #6890 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
* fix no mm inputs
Former-commit-id: 764627645abcd353f9130d5dd8c584810b0e0b1b
2025-02-11 13:30:44 +08:00
hoshi-hiyouga
28037c7834
fix qwen2vl plugin ( #6855 )
...
Former-commit-id: 40048ab77a8b25a91a844800f0f1e880b84548cd
2025-02-08 10:59:10 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
Zhangchi Feng
85f22d01bf
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
Former-commit-id: ab9bd068efee861452407cdda08ef014d5ce23d5
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
f87c788154
[misc] update mm plugin ( #6691 )
...
Former-commit-id: c0caa7afc60ed3015fe6c263ba3566202ba934f1
2025-01-17 23:04:26 +08:00
hoshi-hiyouga
8f73c75c16
[model] fix mllama any image ( #6637 )
...
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d70bf5f8ee83852a023ed27dfc96900
2025-01-14 16:47:58 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
Zhangchi Feng
15bba15725
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
Former-commit-id: 3077f20339158564009270edf79c8ef1b10e8b4a
2025-01-13 16:46:48 +08:00
fzc8578
313ce9a576
remove tests
...
Former-commit-id: a019cece8009b0ba8a6b5a309ed5abfe6cb88a75
2025-01-13 15:08:35 +08:00
fzc8578
4741eec2d1
fix style
...
Former-commit-id: 0cc7260a93bf7c65451e376245aa143f9237d7d8
2025-01-13 14:19:38 +08:00
fzc8578
d2afe0c63c
fix system prompt and tests
...
Former-commit-id: cfaa8e4890ad99ec1fb90d9550503d734b5c30b7
2025-01-13 14:18:06 +08:00
fzc8578
bdded9d41a
add some
...
Former-commit-id: 01e9cfd406dc21f387b4f2baa1d61195a841ccb5
2025-01-11 15:03:20 +08:00
fzc8578
0fb50f9c88
add some
...
Former-commit-id: 771cc802941cf1953b32e5102c817c6a3090b5ce
2025-01-10 23:29:06 +08:00
fzc8578
bcbe37ff52
add some
...
Former-commit-id: ae1f528df31194fe37a123ba1e5a4cd263a61602
2025-01-10 21:25:32 +08:00
fzc8578
994049380d
fix some
...
Former-commit-id: 15bbcdf8d3265f4154d3937719da5e54a5963355
2025-01-10 20:55:52 +08:00
fzc8578
7138b43873
fix some
...
Former-commit-id: 2ee8ba2f390551af1b865cfa813f5c8b7bbb41c5
2025-01-10 20:27:06 +08:00
Zhangchi Feng
f51ac40f0a
Merge branch 'main' into minicpmv
...
Former-commit-id: fc045d7dd871985d621430b5662cba882188a59c
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219
add some
...
Former-commit-id: 096a6cb67a7dfd14a6e339d96baab78c12d36a87
2025-01-10 20:01:22 +08:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
2025-01-08 09:56:10 +00:00
fzc8578
b9eeaa9706
add some
...
Former-commit-id: 785cc70ff205f5962c3ca67f453589e4a471ba8c
2025-01-06 19:32:39 +08:00
fzc8578
b5ef5059ee
add some
...
Former-commit-id: 79c2d7090cbf364063ea3608814ab18aa27fdc87
2025-01-04 11:11:15 +08:00
hiyouga
88b06a0c7f
support qwen2vl vllm infer
...
Former-commit-id: 207f8b069ca35a28de4588b4962e7254f451c52c
2024-12-05 10:17:26 +00:00
hiyouga
006022cadd
fix mllama cross_mask
...
Former-commit-id: 598c22e43f3f10a335933339cc612744c4835eb0
2024-11-26 15:56:58 +00:00
hoshi-hiyouga
118ffe50e3
lint
...
Former-commit-id: da9e4ddd26ebd6e7eb266aa0bef7505465a6b119
2024-11-25 22:55:56 +08:00
hoshi-hiyouga
c0ffe68745
fix #6139
...
Former-commit-id: d87e16cf5c46dadbfcda7b8ac8edfef6a012f97f
2024-11-25 22:22:06 +08:00
hiyouga
65699c29d4
fix vllm
...
Former-commit-id: 13ee1f5cec815590c5d290f0aca264e6d16ddd5d
2024-11-25 00:07:24 +08:00