Xiaosu Zhu
|
a52496cc09
|
Fix tokenizer max length (#6632)
Former-commit-id: 58d029f3212dba1808e63cc8875022f6d741bd63
|
2025-01-14 17:35:54 +08:00 |
|
Zhangchi Feng
|
ad119afc58
|
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d340d5e4ca23263ffad042f861fd77deb
|
2025-01-14 17:34:58 +08:00 |
|
hoshi-hiyouga
|
8f73c75c16
|
[model] fix mllama any image (#6637)
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d70bf5f8ee83852a023ed27dfc96900
|
2025-01-14 16:47:58 +08:00 |
|
hoshi-hiyouga
|
5e699458e5
|
pin vllm version to 0.6.5 (#6629)
Former-commit-id: 1c7663d3049e00a9148c3e3c58204deca7a08c8d
|
2025-01-14 02:44:02 +08:00 |
|
Zhangchi Feng
|
201a495154
|
Support new features of MiniCPM-V (#6626)
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
|
2025-01-14 00:26:19 +08:00 |
|
hoshi-hiyouga
|
d8cba9464f
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
|
2025-01-13 21:34:20 +08:00 |
|
codingma
|
089c7d5e51
|
add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912336190d6b3583f70b6340ab9cf9cdf
|
2025-01-13 19:43:36 +08:00 |
|
Zhangchi Feng
|
15bba15725
|
Fix template name of MiniCPM-V (#6620)
* fix template name
* tiny fix
Former-commit-id: 3077f20339158564009270edf79c8ef1b10e8b4a
|
2025-01-13 16:46:48 +08:00 |
|
fzc8578
|
313ce9a576
|
remove tests
Former-commit-id: a019cece8009b0ba8a6b5a309ed5abfe6cb88a75
|
2025-01-13 15:08:35 +08:00 |
|
fzc8578
|
4741eec2d1
|
fix style
Former-commit-id: 0cc7260a93bf7c65451e376245aa143f9237d7d8
|
2025-01-13 14:19:38 +08:00 |
|
fzc8578
|
d2afe0c63c
|
fix system prompt and tests
Former-commit-id: cfaa8e4890ad99ec1fb90d9550503d734b5c30b7
|
2025-01-13 14:18:06 +08:00 |
|
fzc8578
|
bdded9d41a
|
add some
Former-commit-id: 01e9cfd406dc21f387b4f2baa1d61195a841ccb5
|
2025-01-11 15:03:20 +08:00 |
|
fzc8578
|
e7f928adc4
|
fix format
Former-commit-id: 7b44f3127ef7e91a6bedca0311feb14974914ddf
|
2025-01-11 01:27:40 +08:00 |
|
fzc8578
|
62c12a133e
|
add some
Former-commit-id: a650e114e907278ece188922467c2514de544eeb
|
2025-01-11 01:10:24 +08:00 |
|
fzc8578
|
08e8499a98
|
adapt to new mllm_param
Former-commit-id: 291384dea8a5c10f0358a30d124eaf85557548eb
|
2025-01-11 00:16:34 +08:00 |
|
Zhangchi Feng
|
d5b18ee4a6
|
Merge branch 'main' into minicpmv
Former-commit-id: ed0895a9c13b0ea8a5cace6b060f01d9771816ad
|
2025-01-11 00:01:36 +08:00 |
|
hiyouga
|
c89d17ab63
|
refactor mllm param logic
Former-commit-id: f6f630a1c96514053176abb12e35a06242e62abd
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
0fb50f9c88
|
add some
Former-commit-id: 771cc802941cf1953b32e5102c817c6a3090b5ce
|
2025-01-10 23:29:06 +08:00 |
|
fzc8578
|
bcbe37ff52
|
add some
Former-commit-id: ae1f528df31194fe37a123ba1e5a4cd263a61602
|
2025-01-10 21:25:32 +08:00 |
|
fzc8578
|
994049380d
|
fix some
Former-commit-id: 15bbcdf8d3265f4154d3937719da5e54a5963355
|
2025-01-10 20:55:52 +08:00 |
|
fzc8578
|
7138b43873
|
fix some
Former-commit-id: 2ee8ba2f390551af1b865cfa813f5c8b7bbb41c5
|
2025-01-10 20:27:06 +08:00 |
|
fzc8578
|
aeb4f82ef2
|
tiny fix
Former-commit-id: 84026be06e34239a828a0cc8b1706084afcfa4ea
|
2025-01-10 20:15:39 +08:00 |
|
Zhangchi Feng
|
f51ac40f0a
|
Merge branch 'main' into minicpmv
Former-commit-id: fc045d7dd871985d621430b5662cba882188a59c
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
165fe8e219
|
add some
Former-commit-id: 096a6cb67a7dfd14a6e339d96baab78c12d36a87
|
2025-01-10 20:01:22 +08:00 |
|
hiyouga
|
b471def13d
|
improve template, add phi4 model
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
|
2025-01-09 18:27:54 +00:00 |
|
hoshi-hiyouga
|
b777fed171
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: 6b34b69fa688c4622489d3d5f33d847fb6b95528
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
014a7ea042
|
fix get ray args when args not a dict
Former-commit-id: 9c4c84828b77acf48caf60726e4e7ef3e972118d
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
0c1ad5f3fb
|
fix llamaboard with ray
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
bba52e258e
|
run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
1217240918
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
8c57169eb7
|
fix #6546
Former-commit-id: 870f23d7eaff1e32a73fee4eb972163c85ba7b67
|
2025-01-07 06:30:44 +00:00 |
|
fzc8578
|
b9eeaa9706
|
add some
Former-commit-id: 785cc70ff205f5962c3ca67f453589e4a471ba8c
|
2025-01-06 19:32:39 +08:00 |
|
Zhangchi Feng
|
a0188a430f
|
Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: ab87bd6b1398b379b1a7a95f01a6539743b9db2d
|
2025-01-04 11:20:33 +08:00 |
|
fzc8578
|
b5ef5059ee
|
add some
Former-commit-id: 79c2d7090cbf364063ea3608814ab18aa27fdc87
|
2025-01-04 11:11:15 +08:00 |
|
hiyouga
|
528fb4f799
|
update model name
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
|
2025-01-02 12:19:21 +00:00 |
|
hiyouga
|
37c60c7d14
|
add gpt2 model
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
|
2025-01-02 12:07:38 +00:00 |
|
hiyouga
|
da8721a70e
|
fix #6499
Former-commit-id: 1800f8c72dfa618c71c84a3a18ecdef4d82754f7
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
d0e729cd33
|
add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
|
2024-12-30 13:39:20 +00:00 |
|
hoshi-hiyouga
|
1178cb0e33
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
Former-commit-id: 91467ed313802ac3950c2e11a7d0997a36bcbddd
|
2024-12-30 21:08:25 +08:00 |
|
hiyouga
|
813f5919a3
|
fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
3bcb4633ca
|
fix #6448
Former-commit-id: 27198679829fb766c7eef468ae4311fdced695a2
|
2024-12-27 16:54:39 +00:00 |
|
youkaichao
|
f6d5dd6f10
|
Update cli.py
Former-commit-id: c39d81cd1d108d832746e100ac890b2d4ecaa60e
|
2024-12-26 23:22:09 +08:00 |
|
hiyouga
|
c83b74ab9e
|
add qvq #6439
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
|
2024-12-25 07:52:41 +00:00 |
|
hiyouga
|
353259f03f
|
update readme
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
|
2024-12-23 14:08:59 +00:00 |
|
hoshi-hiyouga
|
8265d6a228
|
Merge pull request #5922 from Tuyohai/main
support granite3 models
Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
|
2024-12-23 16:46:02 +08:00 |
|
hiyouga
|
47c2d91933
|
support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
|
2024-12-21 21:42:45 +00:00 |
|
hiyouga
|
f07bad7144
|
fix paligemma infer
Former-commit-id: 84cd1188ac03c165e1a626db297936c2458627d6
|
2024-12-21 20:24:32 +00:00 |
|
hoshi-hiyouga
|
547f76e56e
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a30d8eb7b612da53bbf538ead7dd27b7
|
2024-12-21 14:09:33 +08:00 |
|