Commit Graph

137 Commits

Author SHA1 Message Date
hoshi-hiyouga
be66df1f02 [data] fix mm template (#7181) 2025-03-06 15:18:32 +08:00
rabbit
049ddf48af [data] bailing template (#7117)
* add bailing template

* add bailing template

* add bailing template

---------

Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com>
2025-03-03 15:33:22 +08:00
hoshi-hiyouga
ec1a1bc118 [model] add models (#7054)
* add qwen25vl awq models

* add moonlight
2025-02-24 22:05:13 +08:00
Zhangchi Feng
2faf8aeff8 [data] fix minicpmo template (#6946) 2025-02-15 00:37:41 +08:00
hoshi-hiyouga
617c8ab467 [breaking change] refactor data pipeline (#6901)
* refactor data

* rename file
2025-02-13 00:39:20 +08:00
hoshi-hiyouga
2f8b6847f5 [data] feat: auto template (#6905)
* support auto template

* add unittest
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
e1a7c1242c [data] fix ollama template (#6902)
* fix ollama template

* add meta info

* use half precision
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
9184a6e0ed [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
d1b8aa3835 [data] refactor template (#6896) 2025-02-11 17:59:25 +08:00
Zhangchi Feng
764627645a [da'ta] fix minicpmv plugin (#6890)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

* support dpo of minicpmv

* update init audio

* update init audio

* [model]fix image process in minicpmo

* fix no mm inputs
2025-02-11 13:30:44 +08:00
hoshi-hiyouga
1356f9d840 [dataset] add openthought (#6866) 2025-02-09 00:53:01 +08:00
Zhangchi Feng
24c7842948 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
e2dc5b952a [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
94803d8133 [model] add mistral small models (#6786) 2025-02-01 04:31:38 +08:00
hoshi-hiyouga
45e68b9f09 [webui] improve webui & reasoning mode (#6778) 2025-01-31 00:09:21 +08:00
hoshi-hiyouga
1278c3e92e lint (#6641) 2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
deacc00b12 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint
2025-01-14 18:07:27 +08:00
hoshi-hiyouga
98189c8e4d [model] fix mllama any image (#6637)
* fix mllama any image

* reorder classes
2025-01-14 16:47:58 +08:00
Zhangchi Feng
c3fda5046d Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
e3e2c8c689 [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples
2025-01-13 21:34:20 +08:00
Zhangchi Feng
3077f20339 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix
2025-01-13 16:46:48 +08:00
fzc8578
cfaa8e4890 fix system prompt and tests 2025-01-13 14:18:06 +08:00
Zhangchi Feng
ed0895a9c1 Merge branch 'main' into minicpmv 2025-01-11 00:01:36 +08:00
hiyouga
f6f630a1c9 refactor mllm param logic 2025-01-10 15:45:48 +00:00
fzc8578
2ee8ba2f39 fix some 2025-01-10 20:27:06 +08:00
Zhangchi Feng
fc045d7dd8 Merge branch 'main' into minicpmv 2025-01-10 20:12:07 +08:00
hiyouga
ae16ea755d improve template, add phi4 model 2025-01-09 18:27:54 +00:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
fzc8578
79c2d7090c add some 2025-01-04 11:11:15 +08:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hoshi-hiyouga
91467ed313 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
2024-12-30 21:08:25 +08:00
hiyouga
8fd38d273e update readme 2024-12-23 14:08:59 +00:00
hoshi-hiyouga
c23a4d0658 Merge pull request #5922 from Tuyohai/main
support granite3 models
2024-12-23 16:46:02 +08:00
hiyouga
d3509050dc add paligemma2 2024-12-18 08:57:26 +00:00
hoshi-hiyouga
015f213788 Merge pull request #6313 from ge-xing/main
support telechat2 model
2024-12-18 16:16:17 +08:00
hiyouga
98795854e3 support qwen tool format 2024-12-17 20:12:06 +00:00
hiyouga
bcc413cf64 change default replace jinja to false 2024-12-17 19:27:10 +00:00
ylfeng
115924af47 Support Mistral format tools 2024-12-17 19:13:26 +00:00
hiyouga
df5655f61c fix llama3 tool template 2024-12-17 17:05:10 +00:00
hiyouga
b24ae55ebf support llama3 tool prompt 2024-12-17 15:52:37 +00:00
zhaohu xing
04f19ed0f3 support telechat2 model 2024-12-17 12:15:33 +00:00
hiyouga
046b6fb118 fix dataset 2024-11-27 06:27:44 +00:00
hiyouga
ec9ff8caa2 add skywork o1 2024-11-27 05:51:59 +00:00
hiyouga
17afb7d410 add marco-o1 and openo1 dataset 2024-11-27 04:20:23 +00:00
hiyouga
446441fdb0 fix inputs 2024-11-23 18:26:02 +00:00
marko1616
3f2c056253 Support llama3.2vl. 2024-11-23 16:07:35 +00:00
hiyouga
431ac4892c add qwen-coder and opencoder 2024-11-15 21:48:38 +08:00
steven
6eefb4d7d2 support granite3 models 2024-11-04 10:35:03 +08:00
hoshi-hiyouga
478cbb1aa7 update template 2024-11-02 21:21:22 +08:00
hoshi-hiyouga
5f14910910 Merge branch 'main' into main 2024-11-02 21:20:27 +08:00