hoshi-hiyouga
|
320e40d873
|
update readme (#6648)
Former-commit-id: 563be2286a
|
2025-01-15 11:06:19 +08:00 |
|
hoshi-hiyouga
|
9ef85f8fc4
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
763f9b9df0
|
[optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b
|
2025-01-15 00:24:56 +08:00 |
|
Zhangchi Feng
|
57043fb4e6
|
update readme of MiniCPM-o (#6642)
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
Former-commit-id: 9b7ba093c7
|
2025-01-14 21:22:35 +08:00 |
|
hoshi-hiyouga
|
91433d639c
|
lint (#6641)
Former-commit-id: 1278c3e92e
|
2025-01-14 18:40:07 +08:00 |
|
Haian Huang(深度眸)
|
864ee06243
|
Support InternLM3 Dense 8B Model (#6640)
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b12
|
2025-01-14 18:07:27 +08:00 |
|
Xiaosu Zhu
|
a52496cc09
|
Fix tokenizer max length (#6632)
Former-commit-id: 58d029f321
|
2025-01-14 17:35:54 +08:00 |
|
Zhangchi Feng
|
ad119afc58
|
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d34
|
2025-01-14 17:34:58 +08:00 |
|
hoshi-hiyouga
|
8f73c75c16
|
[model] fix mllama any image (#6637)
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d
|
2025-01-14 16:47:58 +08:00 |
|
hoshi-hiyouga
|
5e699458e5
|
pin vllm version to 0.6.5 (#6629)
Former-commit-id: 1c7663d304
|
2025-01-14 02:44:02 +08:00 |
|
Zhangchi Feng
|
201a495154
|
Support new features of MiniCPM-V (#6626)
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d
|
2025-01-14 00:26:19 +08:00 |
|
hoshi-hiyouga
|
d8cba9464f
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
|
2025-01-13 21:34:20 +08:00 |
|
codingma
|
089c7d5e51
|
add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912
|
2025-01-13 19:43:36 +08:00 |
|
Zhangchi Feng
|
15bba15725
|
Fix template name of MiniCPM-V (#6620)
* fix template name
* tiny fix
Former-commit-id: 3077f20339
|
2025-01-13 16:46:48 +08:00 |
|
hoshi-hiyouga
|
0b47c2a293
|
Merge pull request #6598 from BUAADreamer/minicpmv
[model] Support MiniCPM-V
Former-commit-id: 6eec50c74d
|
2025-01-13 15:24:02 +08:00 |
|
fzc8578
|
313ce9a576
|
remove tests
Former-commit-id: a019cece80
|
2025-01-13 15:08:35 +08:00 |
|
fzc8578
|
ee87d318b8
|
fix tests
Former-commit-id: c2fa4cc7b1
|
2025-01-13 15:01:39 +08:00 |
|
fzc8578
|
4741eec2d1
|
fix style
Former-commit-id: 0cc7260a93
|
2025-01-13 14:19:38 +08:00 |
|
fzc8578
|
d2afe0c63c
|
fix system prompt and tests
Former-commit-id: cfaa8e4890
|
2025-01-13 14:18:06 +08:00 |
|
fzc8578
|
bdded9d41a
|
add some
Former-commit-id: 01e9cfd406
|
2025-01-11 15:03:20 +08:00 |
|
fzc8578
|
8c79fe6a5a
|
add cpm_o test
Former-commit-id: 10073319b4
|
2025-01-11 11:55:30 +08:00 |
|
fzc8578
|
63bb2b7235
|
add cpm_o test
Former-commit-id: c506f763df
|
2025-01-11 11:49:03 +08:00 |
|
fzc8578
|
e7f928adc4
|
fix format
Former-commit-id: 7b44f3127e
|
2025-01-11 01:27:40 +08:00 |
|
fzc8578
|
62c12a133e
|
add some
Former-commit-id: a650e114e9
|
2025-01-11 01:10:24 +08:00 |
|
fzc8578
|
08e8499a98
|
adapt to new mllm_param
Former-commit-id: 291384dea8
|
2025-01-11 00:16:34 +08:00 |
|
Zhangchi Feng
|
d5b18ee4a6
|
Merge branch 'main' into minicpmv
Former-commit-id: ed0895a9c1
|
2025-01-11 00:01:36 +08:00 |
|
hoshi-hiyouga
|
93cc1f167b
|
Merge pull request #6600 from hiyouga/hiyouga/refactor_mllm_param
[model] refactor mllm param logic
Former-commit-id: 382e932228
|
2025-01-10 23:53:37 +08:00 |
|
hiyouga
|
c89d17ab63
|
refactor mllm param logic
Former-commit-id: f6f630a1c9
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
9213e48fa2
|
add minicpmv2.6
Former-commit-id: e45329e745
|
2025-01-10 23:45:44 +08:00 |
|
fzc8578
|
0fb50f9c88
|
add some
Former-commit-id: 771cc80294
|
2025-01-10 23:29:06 +08:00 |
|
fzc8578
|
bcbe37ff52
|
add some
Former-commit-id: ae1f528df3
|
2025-01-10 21:25:32 +08:00 |
|
fzc8578
|
994049380d
|
fix some
Former-commit-id: 15bbcdf8d3
|
2025-01-10 20:55:52 +08:00 |
|
fzc8578
|
cc6a6f698f
|
fix version
Former-commit-id: d09032049c
|
2025-01-10 20:31:04 +08:00 |
|
fzc8578
|
7138b43873
|
fix some
Former-commit-id: 2ee8ba2f39
|
2025-01-10 20:27:06 +08:00 |
|
fzc8578
|
aeb4f82ef2
|
tiny fix
Former-commit-id: 84026be06e
|
2025-01-10 20:15:39 +08:00 |
|
Zhangchi Feng
|
f51ac40f0a
|
Merge branch 'main' into minicpmv
Former-commit-id: fc045d7dd8
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
165fe8e219
|
add some
Former-commit-id: 096a6cb67a
|
2025-01-10 20:01:22 +08:00 |
|
hoshi-hiyouga
|
4243c618f0
|
Merge pull request #6597 from hiyouga/hiyouga/upd_wechat
[assets] update wechat
Former-commit-id: b308ddf097
|
2025-01-10 18:41:47 +08:00 |
|
hiyouga
|
368d22f79a
|
update wechat
Former-commit-id: 70ed03b288
|
2025-01-10 10:40:25 +00:00 |
|
hoshi-hiyouga
|
b3561ae552
|
Merge pull request #6588 from hiyouga/hiyouga/upd_issue_temp
[gh] update issue template
Former-commit-id: 5ffd8ad192
|
2025-01-10 03:03:48 +08:00 |
|
hiyouga
|
b395540826
|
update issue template
Former-commit-id: aa8d0a223b
|
2025-01-09 18:58:53 +00:00 |
|
hoshi-hiyouga
|
a1b5644889
|
Merge pull request #6585 from hiyouga/hiyouga/add_phi4
[model] add phi4 model
Former-commit-id: 8b209cb49d
|
2025-01-10 02:39:17 +08:00 |
|
hiyouga
|
b471def13d
|
improve template, add phi4 model
Former-commit-id: ae16ea755d
|
2025-01-09 18:27:54 +00:00 |
|
hoshi-hiyouga
|
b777fed171
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: 6b34b69fa6
|
2025-01-08 18:14:18 +08:00 |
|
hoshi-hiyouga
|
618ceda6e9
|
Merge pull request #6565 from hiyouga/hiyouga/improve_log
[misc] imporve log
Former-commit-id: 18431527ba
|
2025-01-08 18:08:21 +08:00 |
|
zhubin
|
014a7ea042
|
fix get ray args when args not a dict
Former-commit-id: 9c4c84828b
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
da542fad18
|
imporve log
Former-commit-id: 47e17dd689
|
2025-01-08 09:56:10 +00:00 |
|
hoshi-hiyouga
|
984b202f83
|
Merge pull request #6542 from erictang000/et/ray-integration
Ray Train integration with LLaMA-Factory
Former-commit-id: d23a98825b
|
2025-01-08 11:46:03 +08:00 |
|
hiyouga
|
0c1ad5f3fb
|
fix llamaboard with ray
Former-commit-id: c46675d5e5
|
2025-01-07 09:59:24 +00:00 |
|
hiyouga
|
b4174021d6
|
refactor ray integration, support save ckpt
Former-commit-id: d8cac6f546
|
2025-01-07 09:39:10 +00:00 |
|