Commit Graph

690 Commits

Author SHA1 Message Date
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
91433d639c lint (#6641)
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint

Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
Xiaosu Zhu
a52496cc09 Fix tokenizer max length (#6632)
Former-commit-id: 58d029f321
2025-01-14 17:35:54 +08:00
Zhangchi Feng
ad119afc58 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

Former-commit-id: 158a127d34
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
8f73c75c16 [model] fix mllama any image (#6637)
* fix mllama any image

* reorder classes

Former-commit-id: 98189c8e4d
2025-01-14 16:47:58 +08:00
hoshi-hiyouga
5e699458e5 pin vllm version to 0.6.5 (#6629)
Former-commit-id: 1c7663d304
2025-01-14 02:44:02 +08:00
Zhangchi Feng
201a495154 Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6

Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix

Former-commit-id: 3077f20339
2025-01-13 16:46:48 +08:00
fzc8578
313ce9a576 remove tests
Former-commit-id: a019cece80
2025-01-13 15:08:35 +08:00
fzc8578
4741eec2d1 fix style
Former-commit-id: 0cc7260a93
2025-01-13 14:19:38 +08:00
fzc8578
d2afe0c63c fix system prompt and tests
Former-commit-id: cfaa8e4890
2025-01-13 14:18:06 +08:00
fzc8578
bdded9d41a add some
Former-commit-id: 01e9cfd406
2025-01-11 15:03:20 +08:00
fzc8578
e7f928adc4 fix format
Former-commit-id: 7b44f3127e
2025-01-11 01:27:40 +08:00
fzc8578
62c12a133e add some
Former-commit-id: a650e114e9
2025-01-11 01:10:24 +08:00
fzc8578
08e8499a98 adapt to new mllm_param
Former-commit-id: 291384dea8
2025-01-11 00:16:34 +08:00
Zhangchi Feng
d5b18ee4a6 Merge branch 'main' into minicpmv
Former-commit-id: ed0895a9c1
2025-01-11 00:01:36 +08:00
hiyouga
c89d17ab63 refactor mllm param logic
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
fzc8578
0fb50f9c88 add some
Former-commit-id: 771cc80294
2025-01-10 23:29:06 +08:00
fzc8578
bcbe37ff52 add some
Former-commit-id: ae1f528df3
2025-01-10 21:25:32 +08:00
fzc8578
994049380d fix some
Former-commit-id: 15bbcdf8d3
2025-01-10 20:55:52 +08:00
fzc8578
7138b43873 fix some
Former-commit-id: 2ee8ba2f39
2025-01-10 20:27:06 +08:00
fzc8578
aeb4f82ef2 tiny fix
Former-commit-id: 84026be06e
2025-01-10 20:15:39 +08:00
Zhangchi Feng
f51ac40f0a Merge branch 'main' into minicpmv
Former-commit-id: fc045d7dd8
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219 add some
Former-commit-id: 096a6cb67a
2025-01-10 20:01:22 +08:00
hiyouga
b471def13d improve template, add phi4 model
Former-commit-id: ae16ea755d
2025-01-09 18:27:54 +00:00
hoshi-hiyouga
b777fed171 Merge pull request #6564 from stephen-nju/fix_ray
Fix ray

Former-commit-id: 6b34b69fa6
2025-01-08 18:14:18 +08:00
zhubin
014a7ea042 fix –get ray args when args not a dict
Former-commit-id: 9c4c84828b
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
0c1ad5f3fb fix llamaboard with ray
Former-commit-id: c46675d5e5
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
hiyouga
8c57169eb7 fix #6546
Former-commit-id: 870f23d7ea
2025-01-07 06:30:44 +00:00
fzc8578
b9eeaa9706 add some
Former-commit-id: 785cc70ff2
2025-01-06 19:32:39 +08:00
Zhangchi Feng
a0188a430f Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: ab87bd6b13
2025-01-04 11:20:33 +08:00
fzc8578
b5ef5059ee add some
Former-commit-id: 79c2d7090c
2025-01-04 11:11:15 +08:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add7287
2025-01-02 12:19:21 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497
2025-01-02 12:07:38 +00:00
hiyouga
da8721a70e fix #6499
Former-commit-id: 1800f8c72d
2025-01-02 11:28:54 +00:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3a
2024-12-30 13:39:20 +00:00
hoshi-hiyouga
1178cb0e33 Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template

Former-commit-id: 91467ed313
2024-12-30 21:08:25 +08:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 2719867982
2024-12-27 16:54:39 +00:00
youkaichao
f6d5dd6f10 Update cli.py
Former-commit-id: c39d81cd1d
2024-12-26 23:22:09 +08:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f41
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658
2024-12-23 16:46:02 +08:00