hoshi-hiyouga
617c8ab467
[breaking change] refactor data pipeline ( #6901 )
...
* refactor data
* rename file
2025-02-13 00:39:20 +08:00
marko1616
b7fd1e9c00
[trainer] fix llama3.2 vision kto train ( #6904 )
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
2f8b6847f5
[data] feat: auto template ( #6905 )
...
* support auto template
* add unittest
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
e1a7c1242c
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
9184a6e0ed
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
d1b8aa3835
[data] refactor template ( #6896 )
2025-02-11 17:59:25 +08:00
hoshi-hiyouga
aca63bfcca
[data] refactor mm plugin ( #6895 )
...
* refactor plugin
* lint
2025-02-11 16:34:49 +08:00
HJ
9153a7bd83
[data] fix qwen_2_5_vl video processing ( #6868 )
...
* fix qwen_2_5_vl video processing
* Update mm_plugin.py
* Update mm_plugin.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-02-11 16:14:50 +08:00
Zhangchi Feng
764627645a
[da'ta] fix minicpmv plugin ( #6890 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
* fix no mm inputs
2025-02-11 13:30:44 +08:00
HJ
0fb44cb3a5
[data] fix: sharegpt converter ( #6879 )
...
* fix-sharegpt-format
* fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-02-10 21:59:12 +08:00
hoshi-hiyouga
b68199db27
[data] fix mllama collator ( #6874 )
2025-02-09 22:42:25 +08:00
hoshi-hiyouga
f6f3f8d0fc
[test] align test cases ( #6865 )
...
* align test cases
* fix function formatter
2025-02-09 01:03:49 +08:00
hoshi-hiyouga
1356f9d840
[dataset] add openthought ( #6866 )
2025-02-09 00:53:01 +08:00
hoshi-hiyouga
40048ab77a
fix qwen2vl plugin ( #6855 )
2025-02-08 10:59:10 +08:00
Zhangchi Feng
24c7842948
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-02-05 04:59:09 +08:00
Yueqi Song
a5e943f7bc
[data] allow thought in function call ( #6797 )
...
* Update template.py
* Update template.py
* use formatter
* fix regex
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-02-05 02:26:23 +08:00
hoshi-hiyouga
e2dc5b952a
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
2025-02-05 01:53:33 +08:00
Yueqi Song
dd6b7d203e
[data] fix qwen tool template ( #6796 )
...
* Update tool_utils.py
* fix unittest
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-02-05 00:02:00 +08:00
Zhangchi Feng
ab9bd068ef
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
94803d8133
[model] add mistral small models ( #6786 )
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
999c7c8fe0
[model] add qwen2.5 vl models ( #6779 )
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
45e68b9f09
[webui] improve webui & reasoning mode ( #6778 )
2025-01-31 00:09:21 +08:00
hoshi-hiyouga
1f47b6186c
[model] support yarn ( #6693 )
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
c0caa7afc6
[misc] update mm plugin ( #6691 )
2025-01-17 23:04:26 +08:00
Zhangchi Feng
027942789b
[data] Fix minicpmv/o dpo training ( #6657 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
2025-01-15 17:30:37 +08:00
hoshi-hiyouga
1278c3e92e
lint ( #6641 )
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
deacc00b12
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
2025-01-14 18:07:27 +08:00
hoshi-hiyouga
98189c8e4d
[model] fix mllama any image ( #6637 )
...
* fix mllama any image
* reorder classes
2025-01-14 16:47:58 +08:00
Zhangchi Feng
c3fda5046d
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
e3e2c8c689
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
2025-01-13 21:34:20 +08:00
Zhangchi Feng
3077f20339
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
2025-01-13 16:46:48 +08:00
fzc8578
a019cece80
remove tests
2025-01-13 15:08:35 +08:00
fzc8578
0cc7260a93
fix style
2025-01-13 14:19:38 +08:00
fzc8578
cfaa8e4890
fix system prompt and tests
2025-01-13 14:18:06 +08:00
fzc8578
01e9cfd406
add some
2025-01-11 15:03:20 +08:00
fzc8578
7b44f3127e
fix format
2025-01-11 01:27:40 +08:00
fzc8578
a650e114e9
add some
2025-01-11 01:10:24 +08:00
fzc8578
291384dea8
adapt to new mllm_param
2025-01-11 00:16:34 +08:00
Zhangchi Feng
ed0895a9c1
Merge branch 'main' into minicpmv
2025-01-11 00:01:36 +08:00
hiyouga
f6f630a1c9
refactor mllm param logic
2025-01-10 15:45:48 +00:00
fzc8578
771cc80294
add some
2025-01-10 23:29:06 +08:00
fzc8578
ae1f528df3
add some
2025-01-10 21:25:32 +08:00
fzc8578
15bbcdf8d3
fix some
2025-01-10 20:55:52 +08:00
fzc8578
2ee8ba2f39
fix some
2025-01-10 20:27:06 +08:00
Zhangchi Feng
fc045d7dd8
Merge branch 'main' into minicpmv
2025-01-10 20:12:07 +08:00
fzc8578
096a6cb67a
add some
2025-01-10 20:01:22 +08:00
hiyouga
ae16ea755d
improve template, add phi4 model
2025-01-09 18:27:54 +00:00
hiyouga
47e17dd689
imporve log
2025-01-08 09:56:10 +00:00
fzc8578
785cc70ff2
add some
2025-01-06 19:32:39 +08:00
Zhangchi Feng
ab87bd6b13
Merge branch 'hiyouga:main' into minicpmv
2025-01-04 11:20:33 +08:00