Commit Graph

278 Commits

Author SHA1 Message Date
hoshi-hiyouga
45e68b9f09 [webui] improve webui & reasoning mode (#6778) 2025-01-31 00:09:21 +08:00
hoshi-hiyouga
1f47b6186c [model] support yarn (#6693) 2025-01-18 13:56:09 +08:00
hoshi-hiyouga
c0caa7afc6 [misc] update mm plugin (#6691) 2025-01-17 23:04:26 +08:00
Zhangchi Feng
027942789b [data] Fix minicpmv/o dpo training (#6657)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

* support dpo of minicpmv
2025-01-15 17:30:37 +08:00
hoshi-hiyouga
1278c3e92e lint (#6641) 2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
deacc00b12 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint
2025-01-14 18:07:27 +08:00
hoshi-hiyouga
98189c8e4d [model] fix mllama any image (#6637)
* fix mllama any image

* reorder classes
2025-01-14 16:47:58 +08:00
Zhangchi Feng
c3fda5046d Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
e3e2c8c689 [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples
2025-01-13 21:34:20 +08:00
Zhangchi Feng
3077f20339 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix
2025-01-13 16:46:48 +08:00
fzc8578
a019cece80 remove tests 2025-01-13 15:08:35 +08:00
fzc8578
0cc7260a93 fix style 2025-01-13 14:19:38 +08:00
fzc8578
cfaa8e4890 fix system prompt and tests 2025-01-13 14:18:06 +08:00
fzc8578
01e9cfd406 add some 2025-01-11 15:03:20 +08:00
fzc8578
7b44f3127e fix format 2025-01-11 01:27:40 +08:00
fzc8578
a650e114e9 add some 2025-01-11 01:10:24 +08:00
fzc8578
291384dea8 adapt to new mllm_param 2025-01-11 00:16:34 +08:00
Zhangchi Feng
ed0895a9c1 Merge branch 'main' into minicpmv 2025-01-11 00:01:36 +08:00
hiyouga
f6f630a1c9 refactor mllm param logic 2025-01-10 15:45:48 +00:00
fzc8578
771cc80294 add some 2025-01-10 23:29:06 +08:00
fzc8578
ae1f528df3 add some 2025-01-10 21:25:32 +08:00
fzc8578
15bbcdf8d3 fix some 2025-01-10 20:55:52 +08:00
fzc8578
2ee8ba2f39 fix some 2025-01-10 20:27:06 +08:00
Zhangchi Feng
fc045d7dd8 Merge branch 'main' into minicpmv 2025-01-10 20:12:07 +08:00
fzc8578
096a6cb67a add some 2025-01-10 20:01:22 +08:00
hiyouga
ae16ea755d improve template, add phi4 model 2025-01-09 18:27:54 +00:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
fzc8578
785cc70ff2 add some 2025-01-06 19:32:39 +08:00
Zhangchi Feng
ab87bd6b13 Merge branch 'hiyouga:main' into minicpmv 2025-01-04 11:20:33 +08:00
fzc8578
79c2d7090c add some 2025-01-04 11:11:15 +08:00
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hoshi-hiyouga
91467ed313 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
2024-12-30 21:08:25 +08:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hiyouga
2719867982 fix #6448 2024-12-27 16:54:39 +00:00
hiyouga
8fd38d273e update readme 2024-12-23 14:08:59 +00:00
hoshi-hiyouga
c23a4d0658 Merge pull request #5922 from Tuyohai/main
support granite3 models
2024-12-23 16:46:02 +08:00
hiyouga
d3509050dc add paligemma2 2024-12-18 08:57:26 +00:00
hoshi-hiyouga
015f213788 Merge pull request #6313 from ge-xing/main
support telechat2 model
2024-12-18 16:16:17 +08:00
hiyouga
98795854e3 support qwen tool format 2024-12-17 20:12:06 +00:00
hiyouga
bcc413cf64 change default replace jinja to false 2024-12-17 19:27:10 +00:00
ylfeng
115924af47 Support Mistral format tools 2024-12-17 19:13:26 +00:00
hiyouga
df5655f61c fix llama3 tool template 2024-12-17 17:05:10 +00:00
hoshi-hiyouga
e12c80ace8 Merge pull request #6367 from hiyouga/hiyouga/add_model
[model&template] add llama3.3 & support llama3 tool prompt
2024-12-18 00:13:28 +08:00
hiyouga
b24ae55ebf support llama3 tool prompt 2024-12-17 15:52:37 +00:00
Yaser Afshar
0943776326 Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security
2024-12-17 12:25:12 +00:00
zhaohu xing
04f19ed0f3 support telechat2 model 2024-12-17 12:15:33 +00:00
hiyouga
142191e466 fix #6348 2024-12-17 10:06:46 +00:00
hiyouga
2811814fc4 fix mrope 2024-12-12 15:08:17 +00:00
hiyouga
207f8b069c support qwen2vl vllm infer 2024-12-05 10:17:26 +00:00