hoshi-hiyouga
|
45e68b9f09
|
[webui] improve webui & reasoning mode (#6778)
|
2025-01-31 00:09:21 +08:00 |
|
hoshi-hiyouga
|
1f47b6186c
|
[model] support yarn (#6693)
|
2025-01-18 13:56:09 +08:00 |
|
hoshi-hiyouga
|
c0caa7afc6
|
[misc] update mm plugin (#6691)
|
2025-01-17 23:04:26 +08:00 |
|
Zhangchi Feng
|
027942789b
|
[data] Fix minicpmv/o dpo training (#6657)
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
|
2025-01-15 17:30:37 +08:00 |
|
hoshi-hiyouga
|
1278c3e92e
|
lint (#6641)
|
2025-01-14 18:40:07 +08:00 |
|
Haian Huang(深度眸)
|
deacc00b12
|
Support InternLM3 Dense 8B Model (#6640)
* support internlm3
* update
* update
* update
* add hint
|
2025-01-14 18:07:27 +08:00 |
|
hoshi-hiyouga
|
98189c8e4d
|
[model] fix mllama any image (#6637)
* fix mllama any image
* reorder classes
|
2025-01-14 16:47:58 +08:00 |
|
Zhangchi Feng
|
c3fda5046d
|
Support new features of MiniCPM-V (#6626)
* fix template name
* tiny fix
* support minicpm-o-2.6
|
2025-01-14 00:26:19 +08:00 |
|
hoshi-hiyouga
|
e3e2c8c689
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
|
2025-01-13 21:34:20 +08:00 |
|
Zhangchi Feng
|
3077f20339
|
Fix template name of MiniCPM-V (#6620)
* fix template name
* tiny fix
|
2025-01-13 16:46:48 +08:00 |
|
fzc8578
|
a019cece80
|
remove tests
|
2025-01-13 15:08:35 +08:00 |
|
fzc8578
|
0cc7260a93
|
fix style
|
2025-01-13 14:19:38 +08:00 |
|
fzc8578
|
cfaa8e4890
|
fix system prompt and tests
|
2025-01-13 14:18:06 +08:00 |
|
fzc8578
|
01e9cfd406
|
add some
|
2025-01-11 15:03:20 +08:00 |
|
fzc8578
|
7b44f3127e
|
fix format
|
2025-01-11 01:27:40 +08:00 |
|
fzc8578
|
a650e114e9
|
add some
|
2025-01-11 01:10:24 +08:00 |
|
fzc8578
|
291384dea8
|
adapt to new mllm_param
|
2025-01-11 00:16:34 +08:00 |
|
Zhangchi Feng
|
ed0895a9c1
|
Merge branch 'main' into minicpmv
|
2025-01-11 00:01:36 +08:00 |
|
hiyouga
|
f6f630a1c9
|
refactor mllm param logic
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
771cc80294
|
add some
|
2025-01-10 23:29:06 +08:00 |
|
fzc8578
|
ae1f528df3
|
add some
|
2025-01-10 21:25:32 +08:00 |
|
fzc8578
|
15bbcdf8d3
|
fix some
|
2025-01-10 20:55:52 +08:00 |
|
fzc8578
|
2ee8ba2f39
|
fix some
|
2025-01-10 20:27:06 +08:00 |
|
Zhangchi Feng
|
fc045d7dd8
|
Merge branch 'main' into minicpmv
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
096a6cb67a
|
add some
|
2025-01-10 20:01:22 +08:00 |
|
hiyouga
|
ae16ea755d
|
improve template, add phi4 model
|
2025-01-09 18:27:54 +00:00 |
|
hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
fzc8578
|
785cc70ff2
|
add some
|
2025-01-06 19:32:39 +08:00 |
|
Zhangchi Feng
|
ab87bd6b13
|
Merge branch 'hiyouga:main' into minicpmv
|
2025-01-04 11:20:33 +08:00 |
|
fzc8578
|
79c2d7090c
|
add some
|
2025-01-04 11:11:15 +08:00 |
|
hiyouga
|
1800f8c72d
|
fix #6499
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
e67b9dcc3a
|
add deepseek3 model
|
2024-12-30 13:39:20 +00:00 |
|
hoshi-hiyouga
|
91467ed313
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
|
2024-12-30 21:08:25 +08:00 |
|
hiyouga
|
6f5bb3b8e5
|
fix #6482
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
2719867982
|
fix #6448
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
8fd38d273e
|
update readme
|
2024-12-23 14:08:59 +00:00 |
|
hoshi-hiyouga
|
c23a4d0658
|
Merge pull request #5922 from Tuyohai/main
support granite3 models
|
2024-12-23 16:46:02 +08:00 |
|
hiyouga
|
d3509050dc
|
add paligemma2
|
2024-12-18 08:57:26 +00:00 |
|
hoshi-hiyouga
|
015f213788
|
Merge pull request #6313 from ge-xing/main
support telechat2 model
|
2024-12-18 16:16:17 +08:00 |
|
hiyouga
|
98795854e3
|
support qwen tool format
|
2024-12-17 20:12:06 +00:00 |
|
hiyouga
|
bcc413cf64
|
change default replace jinja to false
|
2024-12-17 19:27:10 +00:00 |
|
ylfeng
|
115924af47
|
Support Mistral format tools
|
2024-12-17 19:13:26 +00:00 |
|
hiyouga
|
df5655f61c
|
fix llama3 tool template
|
2024-12-17 17:05:10 +00:00 |
|
hoshi-hiyouga
|
e12c80ace8
|
Merge pull request #6367 from hiyouga/hiyouga/add_model
[model&template] add llama3.3 & support llama3 tool prompt
|
2024-12-18 00:13:28 +08:00 |
|
hiyouga
|
b24ae55ebf
|
support llama3 tool prompt
|
2024-12-17 15:52:37 +00:00 |
|
Yaser Afshar
|
0943776326
|
Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
|
2024-12-17 12:25:12 +00:00 |
|
zhaohu xing
|
04f19ed0f3
|
support telechat2 model
|
2024-12-17 12:15:33 +00:00 |
|
hiyouga
|
142191e466
|
fix #6348
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
2811814fc4
|
fix mrope
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
207f8b069c
|
support qwen2vl vllm infer
|
2024-12-05 10:17:26 +00:00 |
|