hoshi-hiyouga
1bb3d17d9e
[data] fix mllama collator ( #6874 )
...
Former-commit-id: b68199db27
2025-02-09 22:42:25 +08:00
hoshi-hiyouga
b93333685b
[test] align test cases ( #6865 )
...
* align test cases
* fix function formatter
Former-commit-id: f6f3f8d0fc
2025-02-09 01:03:49 +08:00
hoshi-hiyouga
fcd0f0480d
[dataset] add openthought ( #6866 )
...
Former-commit-id: 1356f9d840
2025-02-09 00:53:01 +08:00
hoshi-hiyouga
28037c7834
fix qwen2vl plugin ( #6855 )
...
Former-commit-id: 40048ab77a
2025-02-08 10:59:10 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
Yueqi Song
e665e1fed5
[data] allow thought in function call ( #6797 )
...
* Update template.py
* Update template.py
* use formatter
* fix regex
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: a5e943f7bc
2025-02-05 02:26:23 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
Yueqi Song
8504bde893
[data] fix qwen tool template ( #6796 )
...
* Update tool_utils.py
* fix unittest
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: dd6b7d203e
2025-02-05 00:02:00 +08:00
Zhangchi Feng
85f22d01bf
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
Former-commit-id: ab9bd068ef
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
245de012ca
[webui] improve webui & reasoning mode ( #6778 )
...
Former-commit-id: 45e68b9f09
2025-01-31 00:09:21 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
f87c788154
[misc] update mm plugin ( #6691 )
...
Former-commit-id: c0caa7afc6
2025-01-17 23:04:26 +08:00
Zhangchi Feng
555f17c1ee
[data] Fix minicpmv/o dpo training ( #6657 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
Former-commit-id: 027942789b
2025-01-15 17:30:37 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
hoshi-hiyouga
8f73c75c16
[model] fix mllama any image ( #6637 )
...
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d
2025-01-14 16:47:58 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
Zhangchi Feng
15bba15725
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
Former-commit-id: 3077f20339
2025-01-13 16:46:48 +08:00
fzc8578
313ce9a576
remove tests
...
Former-commit-id: a019cece80
2025-01-13 15:08:35 +08:00
fzc8578
4741eec2d1
fix style
...
Former-commit-id: 0cc7260a93
2025-01-13 14:19:38 +08:00
fzc8578
d2afe0c63c
fix system prompt and tests
...
Former-commit-id: cfaa8e4890
2025-01-13 14:18:06 +08:00
fzc8578
bdded9d41a
add some
...
Former-commit-id: 01e9cfd406
2025-01-11 15:03:20 +08:00
fzc8578
e7f928adc4
fix format
...
Former-commit-id: 7b44f3127e
2025-01-11 01:27:40 +08:00
fzc8578
62c12a133e
add some
...
Former-commit-id: a650e114e9
2025-01-11 01:10:24 +08:00
fzc8578
08e8499a98
adapt to new mllm_param
...
Former-commit-id: 291384dea8
2025-01-11 00:16:34 +08:00
Zhangchi Feng
d5b18ee4a6
Merge branch 'main' into minicpmv
...
Former-commit-id: ed0895a9c1
2025-01-11 00:01:36 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
fzc8578
0fb50f9c88
add some
...
Former-commit-id: 771cc80294
2025-01-10 23:29:06 +08:00
fzc8578
bcbe37ff52
add some
...
Former-commit-id: ae1f528df3
2025-01-10 21:25:32 +08:00
fzc8578
994049380d
fix some
...
Former-commit-id: 15bbcdf8d3
2025-01-10 20:55:52 +08:00
fzc8578
7138b43873
fix some
...
Former-commit-id: 2ee8ba2f39
2025-01-10 20:27:06 +08:00
Zhangchi Feng
f51ac40f0a
Merge branch 'main' into minicpmv
...
Former-commit-id: fc045d7dd8
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219
add some
...
Former-commit-id: 096a6cb67a
2025-01-10 20:01:22 +08:00
hiyouga
b471def13d
improve template, add phi4 model
...
Former-commit-id: ae16ea755d
2025-01-09 18:27:54 +00:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
fzc8578
b9eeaa9706
add some
...
Former-commit-id: 785cc70ff2
2025-01-06 19:32:39 +08:00
Zhangchi Feng
a0188a430f
Merge branch 'hiyouga:main' into minicpmv
...
Former-commit-id: ab87bd6b13
2025-01-04 11:20:33 +08:00
fzc8578
b5ef5059ee
add some
...
Former-commit-id: 79c2d7090c
2025-01-04 11:11:15 +08:00
hiyouga
da8721a70e
fix #6499
...
Former-commit-id: 1800f8c72d
2025-01-02 11:28:54 +00:00
hiyouga
d0e729cd33
add deepseek3 model
...
Former-commit-id: e67b9dcc3a
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33
Merge pull request #5507 from piamo/main
...
Add deepseek-v2.5 template
Former-commit-id: 91467ed313
2024-12-30 21:08:25 +08:00
hiyouga
813f5919a3
fix #6482
...
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
3bcb4633ca
fix #6448
...
Former-commit-id: 2719867982
2024-12-27 16:54:39 +00:00
hiyouga
353259f03f
update readme
...
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228
Merge pull request #5922 from Tuyohai/main
...
support granite3 models
Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00
hiyouga
433d116080
add paligemma2
...
Former-commit-id: d3509050dc
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534
Merge pull request #6313 from ge-xing/main
...
support telechat2 model
Former-commit-id: 015f213788
2024-12-18 16:16:17 +08:00