hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
2025-01-15 00:24:56 +08:00
Zhangchi Feng
57043fb4e6
update readme of MiniCPM-o ( #6642 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
Former-commit-id: 9b7ba093c7e017ea18a4562550d5d2e82c4a0161
2025-01-14 21:22:35 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92eeb297e883aab89e2384c1df1d0e910
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b1226ca3d53bf7bb1231cf276eaa8296b
2025-01-14 18:07:27 +08:00
Xiaosu Zhu
a52496cc09
Fix tokenizer max length ( #6632 )
...
Former-commit-id: 58d029f3212dba1808e63cc8875022f6d741bd63
2025-01-14 17:35:54 +08:00
Zhangchi Feng
ad119afc58
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d340d5e4ca23263ffad042f861fd77deb
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
8f73c75c16
[model] fix mllama any image ( #6637 )
...
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d70bf5f8ee83852a023ed27dfc96900
2025-01-14 16:47:58 +08:00
hoshi-hiyouga
5e699458e5
pin vllm version to 0.6.5 ( #6629 )
...
Former-commit-id: 1c7663d3049e00a9148c3e3c58204deca7a08c8d
2025-01-14 02:44:02 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912336190d6b3583f70b6340ab9cf9cdf
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
Former-commit-id: 3077f20339158564009270edf79c8ef1b10e8b4a
2025-01-13 16:46:48 +08:00
hoshi-hiyouga
0b47c2a293
Merge pull request #6598 from BUAADreamer/minicpmv
...
[model] Support MiniCPM-V
Former-commit-id: 6eec50c74dcbcc325ad6258228e19c19b4a03538
2025-01-13 15:24:02 +08:00
fzc8578
313ce9a576
remove tests
...
Former-commit-id: a019cece8009b0ba8a6b5a309ed5abfe6cb88a75
2025-01-13 15:08:35 +08:00
fzc8578
ee87d318b8
fix tests
...
Former-commit-id: c2fa4cc7b114ac1a376882022e4b6ef75d288dca
2025-01-13 15:01:39 +08:00
fzc8578
4741eec2d1
fix style
...
Former-commit-id: 0cc7260a93bf7c65451e376245aa143f9237d7d8
2025-01-13 14:19:38 +08:00
fzc8578
d2afe0c63c
fix system prompt and tests
...
Former-commit-id: cfaa8e4890ad99ec1fb90d9550503d734b5c30b7
2025-01-13 14:18:06 +08:00
fzc8578
bdded9d41a
add some
...
Former-commit-id: 01e9cfd406dc21f387b4f2baa1d61195a841ccb5
2025-01-11 15:03:20 +08:00
fzc8578
8c79fe6a5a
add cpm_o test
...
Former-commit-id: 10073319b4215be900744a28a61bd442e70143cc
2025-01-11 11:55:30 +08:00
fzc8578
63bb2b7235
add cpm_o test
...
Former-commit-id: c506f763dff1c1d2c85ac8fe6beb9f40ca4fcde9
2025-01-11 11:49:03 +08:00
fzc8578
e7f928adc4
fix format
...
Former-commit-id: 7b44f3127ef7e91a6bedca0311feb14974914ddf
2025-01-11 01:27:40 +08:00
fzc8578
62c12a133e
add some
...
Former-commit-id: a650e114e907278ece188922467c2514de544eeb
2025-01-11 01:10:24 +08:00
fzc8578
08e8499a98
adapt to new mllm_param
...
Former-commit-id: 291384dea8a5c10f0358a30d124eaf85557548eb
2025-01-11 00:16:34 +08:00
Zhangchi Feng
d5b18ee4a6
Merge branch 'main' into minicpmv
...
Former-commit-id: ed0895a9c13b0ea8a5cace6b060f01d9771816ad
2025-01-11 00:01:36 +08:00
hoshi-hiyouga
93cc1f167b
Merge pull request #6600 from hiyouga/hiyouga/refactor_mllm_param
...
[model] refactor mllm param logic
Former-commit-id: 382e932228d1bcfcdee0a25ee3f1977226f1c433
2025-01-10 23:53:37 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c96514053176abb12e35a06242e62abd
2025-01-10 15:45:48 +00:00
fzc8578
9213e48fa2
add minicpmv2.6
...
Former-commit-id: e45329e7456b647d5684b1f9428641ad18af92d1
2025-01-10 23:45:44 +08:00
fzc8578
0fb50f9c88
add some
...
Former-commit-id: 771cc802941cf1953b32e5102c817c6a3090b5ce
2025-01-10 23:29:06 +08:00
fzc8578
bcbe37ff52
add some
...
Former-commit-id: ae1f528df31194fe37a123ba1e5a4cd263a61602
2025-01-10 21:25:32 +08:00
fzc8578
994049380d
fix some
...
Former-commit-id: 15bbcdf8d3265f4154d3937719da5e54a5963355
2025-01-10 20:55:52 +08:00
fzc8578
cc6a6f698f
fix version
...
Former-commit-id: d09032049c1f24336a1899908bf47a98e77b3211
2025-01-10 20:31:04 +08:00
fzc8578
7138b43873
fix some
...
Former-commit-id: 2ee8ba2f390551af1b865cfa813f5c8b7bbb41c5
2025-01-10 20:27:06 +08:00
fzc8578
aeb4f82ef2
tiny fix
...
Former-commit-id: 84026be06e34239a828a0cc8b1706084afcfa4ea
2025-01-10 20:15:39 +08:00
Zhangchi Feng
f51ac40f0a
Merge branch 'main' into minicpmv
...
Former-commit-id: fc045d7dd871985d621430b5662cba882188a59c
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219
add some
...
Former-commit-id: 096a6cb67a7dfd14a6e339d96baab78c12d36a87
2025-01-10 20:01:22 +08:00
hoshi-hiyouga
4243c618f0
Merge pull request #6597 from hiyouga/hiyouga/upd_wechat
...
[assets] update wechat
Former-commit-id: b308ddf0971606f0f8f39e26f5711852abad3e79
2025-01-10 18:41:47 +08:00
hiyouga
368d22f79a
update wechat
...
Former-commit-id: 70ed03b288c1853f262e47b06e8601eaf49ccc1b
2025-01-10 10:40:25 +00:00
hoshi-hiyouga
b3561ae552
Merge pull request #6588 from hiyouga/hiyouga/upd_issue_temp
...
[gh] update issue template
Former-commit-id: 5ffd8ad192bb3932fbe230757d4bf1c907ca3aa4
2025-01-10 03:03:48 +08:00
hiyouga
b395540826
update issue template
...
Former-commit-id: aa8d0a223b0345e1f665b6703678c0ce526ff950
2025-01-09 18:58:53 +00:00
hoshi-hiyouga
a1b5644889
Merge pull request #6585 from hiyouga/hiyouga/add_phi4
...
[model] add phi4 model
Former-commit-id: 8b209cb49d9cc6058ce61c97bf2216f6371c5f7c
2025-01-10 02:39:17 +08:00
hiyouga
b471def13d
improve template, add phi4 model
...
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
2025-01-09 18:27:54 +00:00
hoshi-hiyouga
b777fed171
Merge pull request #6564 from stephen-nju/fix_ray
...
Fix ray
Former-commit-id: 6b34b69fa688c4622489d3d5f33d847fb6b95528
2025-01-08 18:14:18 +08:00
hoshi-hiyouga
618ceda6e9
Merge pull request #6565 from hiyouga/hiyouga/improve_log
...
[misc] imporve log
Former-commit-id: 18431527bac8da57d9a2fc014695e5891f7a3068
2025-01-08 18:08:21 +08:00
zhubin
014a7ea042
fix get ray args when args not a dict
...
Former-commit-id: 9c4c84828b77acf48caf60726e4e7ef3e972118d
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
2025-01-08 09:56:10 +00:00
hoshi-hiyouga
984b202f83
Merge pull request #6542 from erictang000/et/ray-integration
...
Ray Train integration with LLaMA-Factory
Former-commit-id: d23a98825bcb569bc51e21a3c2236eccd2f6d2fd
2025-01-08 11:46:03 +08:00
hiyouga
0c1ad5f3fb
fix llamaboard with ray
...
Former-commit-id: c46675d5e56d175c27d705ef0068fb47dc89a872
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e
run style check
...
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
2025-01-07 08:55:44 +00:00