hoshi-hiyouga
|
d0da6f40b0
|
[model] fix mllama any image (#6637)
* fix mllama any image
* reorder classes
Former-commit-id: 1242a1c4b4a465c06363fdc59302e80e5c4c96e6
|
2025-01-14 16:47:58 +08:00 |
|
hoshi-hiyouga
|
28d145a066
|
pin vllm version to 0.6.5 (#6629)
Former-commit-id: 26097ca0adf25ebb7d9e8eec2d2cef673c6cfe88
|
2025-01-14 02:44:02 +08:00 |
|
Zhangchi Feng
|
ae32c148d1
|
Support new features of MiniCPM-V (#6626)
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: 53034a61c7654358f46916cbc370910fb2aeff3b
|
2025-01-14 00:26:19 +08:00 |
|
hoshi-hiyouga
|
2a05941b14
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f
|
2025-01-13 21:34:20 +08:00 |
|
codingma
|
11c38b9173
|
add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 7912d1acac5f10dab22145fe729a90c57aad8d85
|
2025-01-13 19:43:36 +08:00 |
|
Zhangchi Feng
|
73c1c15b62
|
Fix template name of MiniCPM-V (#6620)
* fix template name
* tiny fix
Former-commit-id: 94dea52cef709a7e6f1cdc0b78e83e0422bd65d3
|
2025-01-13 16:46:48 +08:00 |
|
fzc8578
|
ec552372ba
|
remove tests
Former-commit-id: 51addcd7ab81548a9952064dd8c95a8542252003
|
2025-01-13 15:08:35 +08:00 |
|
fzc8578
|
4b61610b12
|
fix style
Former-commit-id: 76a36d9acecbf36b6959a14caacfed1d32bcee41
|
2025-01-13 14:19:38 +08:00 |
|
fzc8578
|
07798e4aad
|
fix system prompt and tests
Former-commit-id: 955efca677b299749f3d40d587ee310951537543
|
2025-01-13 14:18:06 +08:00 |
|
fzc8578
|
6d6acd0213
|
add some
Former-commit-id: 5ad8ef3ec434f53f6fc494474becb034a3aca0ca
|
2025-01-11 15:03:20 +08:00 |
|
fzc8578
|
31bfdb08cd
|
fix format
Former-commit-id: 964e18be5a824950164bc7232d35822a8b116d1a
|
2025-01-11 01:27:40 +08:00 |
|
fzc8578
|
12c83e00fc
|
add some
Former-commit-id: 6233764d18f31365e9ba450408306fad55567ffc
|
2025-01-11 01:10:24 +08:00 |
|
fzc8578
|
9dc7b6c7ac
|
adapt to new mllm_param
Former-commit-id: 0775b71965863c2618c117726a1046a36d6d85b8
|
2025-01-11 00:16:34 +08:00 |
|
Zhangchi Feng
|
627548bf7f
|
Merge branch 'main' into minicpmv
Former-commit-id: 8a9c90759feda975faadc5858bd44b7ea116e7fb
|
2025-01-11 00:01:36 +08:00 |
|
hiyouga
|
dc65ecdf09
|
refactor mllm param logic
Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
1f3b729a4b
|
add some
Former-commit-id: 58f50b8729083e9ea0fdcf07042b06261670ad57
|
2025-01-10 23:29:06 +08:00 |
|
fzc8578
|
0aa7ac210f
|
add some
Former-commit-id: 3acd151a0f8efdd230c0b0980550795d204a69f7
|
2025-01-10 21:25:32 +08:00 |
|
fzc8578
|
40382f1387
|
fix some
Former-commit-id: 1eb7118db3ad6054cfd59d5f16a5d882e40e9057
|
2025-01-10 20:55:52 +08:00 |
|
fzc8578
|
e63c2df0b1
|
fix some
Former-commit-id: cd5a1a8b9c6eb59d6e95f79573f60ad8668f1942
|
2025-01-10 20:27:06 +08:00 |
|
fzc8578
|
25d4889789
|
tiny fix
Former-commit-id: f088e580d3bacd0eecd0c3bf17e928eb49832ba1
|
2025-01-10 20:15:39 +08:00 |
|
Zhangchi Feng
|
8c0a721c4c
|
Merge branch 'main' into minicpmv
Former-commit-id: d8840ae416660e23f1d615ffd404f519360151d9
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
9e972bc9ec
|
add some
Former-commit-id: fede563aeb716ba5d1e368fd3e1182e4e580d248
|
2025-01-10 20:01:22 +08:00 |
|
hiyouga
|
867980196e
|
improve template, add phi4 model
Former-commit-id: a785b6796e445a3adba45c5b6947166a2ff99871
|
2025-01-09 18:27:54 +00:00 |
|
hoshi-hiyouga
|
4e25d037c8
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: d4566839369726023f1b6e8f4b2332bda0c715cc
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
b6b53b61f7
|
fix get ray args when args not a dict
Former-commit-id: 5e5398cd5b117b2378107172d3f91cfb0321e842
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
647c51a772
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
0ef1f981da
|
fix llamaboard with ray
Former-commit-id: bd8a432d6a980b1b24a551626304fe3d394b1baf
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
944a2aec4d
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
4f31ad997c
|
run style check
Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
8683582300
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
d8bd46f1bf
|
fix #6546
Former-commit-id: 6fcf2f10faf3b1614896b091591eeef96d717e64
|
2025-01-07 06:30:44 +00:00 |
|
fzc8578
|
8c2a712247
|
add some
Former-commit-id: b4790c66c126567bd193de52a564e3ce11c94769
|
2025-01-06 19:32:39 +08:00 |
|
Zhangchi Feng
|
08729dbefc
|
Merge branch 'hiyouga:main' into minicpmv
Former-commit-id: 873b2d5888038e2328a12a6eb7c84099ba7ca1f3
|
2025-01-04 11:20:33 +08:00 |
|
fzc8578
|
2c120aa0df
|
add some
Former-commit-id: 81176fe226da89eace89cb202bad68e73b7c2a02
|
2025-01-04 11:11:15 +08:00 |
|
hiyouga
|
8a5b4bdfd4
|
update model name
Former-commit-id: bf627d9f1ac117f040adbfd7630b5283f0db556a
|
2025-01-02 12:19:21 +00:00 |
|
hiyouga
|
18a1a4b9da
|
add gpt2 model
Former-commit-id: 37d5e3639fcf5ae6e58cc435e0fa9dee0d6e4ead
|
2025-01-02 12:07:38 +00:00 |
|
hiyouga
|
2aaf3697d7
|
fix #6499
Former-commit-id: dffc607220ff6dac15cf501ac9a3cdbe80c25211
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
b2e4f11602
|
add deepseek3 model
Former-commit-id: 611779d412f31e25b1ed38049050eee2da61dde5
|
2024-12-30 13:39:20 +00:00 |
|
hoshi-hiyouga
|
e3f95abca7
|
Merge pull request #5507 from piamo/main
Add deepseek-v2.5 template
Former-commit-id: 8a4911d201e219465fe0835a3ceb967f8b80dc0e
|
2024-12-30 21:08:25 +08:00 |
|
hiyouga
|
f8f05a883b
|
fix #6482
Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
88b1874c04
|
fix #6448
Former-commit-id: 04f78e85af5af14b4c195936623e426a6a128af2
|
2024-12-27 16:54:39 +00:00 |
|
youkaichao
|
552816e04b
|
Update cli.py
Former-commit-id: 18e65bbd3ae07af3b9eed7f293c345815776c325
|
2024-12-26 23:22:09 +08:00 |
|
hiyouga
|
3c55976a0e
|
add qvq #6439
Former-commit-id: 4dbfa142d899dd6e4d1a9d4db125765af5580a4f
|
2024-12-25 07:52:41 +00:00 |
|
hiyouga
|
a5346041bb
|
update readme
Former-commit-id: 1deda4750e0df6c46aeb33cf3f8b35baa537cc1d
|
2024-12-23 14:08:59 +00:00 |
|
hoshi-hiyouga
|
df42e438c1
|
Merge pull request #5922 from Tuyohai/main
support granite3 models
Former-commit-id: a9087bc0549f7f16e5b4c39e324043755b1618c8
|
2024-12-23 16:46:02 +08:00 |
|
hiyouga
|
a897d46049
|
support report custom args
Former-commit-id: d41254c40a1c5cacf9377096adb27efa9bdb79ea
|
2024-12-21 21:42:45 +00:00 |
|
hiyouga
|
adff887659
|
fix paligemma infer
Former-commit-id: d272455d6118c1d670c70cfe3458d8dab111da6c
|
2024-12-21 20:24:32 +00:00 |
|
hoshi-hiyouga
|
0a869c4ed4
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: e65fe507f7643bf40b0fc462805c7b7f8ef6b738
|
2024-12-21 14:09:33 +08:00 |
|
ZeYi Lin
|
f792eaf8d4
|
fix: project blank
Former-commit-id: 3a0939572b0bfc7da0ee1a7244b6b3fbf567aba0
|
2024-12-20 18:26:02 +08:00 |
|
ZeYi Lin
|
8a41c96761
|
fix: by hiyouga suggestion
Former-commit-id: 41195f1bc69e4b5da7a265369d368b06754362cf
|
2024-12-20 16:43:03 +08:00 |
|