yinpu
aa7c07caf0
fix: avoid redundant normalization in DPO's SFT loss calculation ( #6722 )
...
Former-commit-id: 0f45982bac
2025-01-21 13:38:02 +08:00
engchina
324f07613a
[webui] support ja ( #6698 )
...
* add support for japanese language
* add support for japanese language
---------
Co-authored-by: engchina <atjapan2015@gmail.com >
Former-commit-id: de9bc3fefa
2025-01-20 19:46:38 +08:00
hoshi-hiyouga
0c59483368
[assets] update wechat ( #6710 )
...
Former-commit-id: 3962645ac0
2025-01-20 16:29:24 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
ee0b3b1e1a
[assets] update wechat ( #6692 )
...
Former-commit-id: 17b470630d
2025-01-18 12:35:03 +08:00
hoshi-hiyouga
f87c788154
[misc] update mm plugin ( #6691 )
...
Former-commit-id: c0caa7afc6
2025-01-17 23:04:26 +08:00
hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
770433fa33
[webui] upgrade to gradio 5 ( #6688 )
...
Former-commit-id: 4d0f662dbe
2025-01-17 20:15:42 +08:00
hoshi-hiyouga
788accb601
fix qwen2 moe ( #6684 )
...
Former-commit-id: 7bf09abf1c
2025-01-17 13:46:09 +08:00
Zhangchi Feng
555f17c1ee
[data] Fix minicpmv/o dpo training ( #6657 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
Former-commit-id: 027942789b
2025-01-15 17:30:37 +08:00
steveepreston
8895cf1152
Update val_size english description ( #6653 )
...
* Update `val_size` Description in locales.py
* Update `val_size` Description in data_args.py
* Remove extra space in data_args.py
Former-commit-id: 76675b654e
2025-01-15 16:00:20 +08:00
hoshi-hiyouga
320e40d873
update readme ( #6648 )
...
Former-commit-id: 563be2286a
2025-01-15 11:06:19 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
Zhangchi Feng
57043fb4e6
update readme of MiniCPM-o ( #6642 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
Former-commit-id: 9b7ba093c7
2025-01-14 21:22:35 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
Xiaosu Zhu
a52496cc09
Fix tokenizer max length ( #6632 )
...
Former-commit-id: 58d029f321
2025-01-14 17:35:54 +08:00
Zhangchi Feng
ad119afc58
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d34
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
8f73c75c16
[model] fix mllama any image ( #6637 )
...
* fix mllama any image
* reorder classes
Former-commit-id: 98189c8e4d
2025-01-14 16:47:58 +08:00
hoshi-hiyouga
5e699458e5
pin vllm version to 0.6.5 ( #6629 )
...
Former-commit-id: 1c7663d304
2025-01-14 02:44:02 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725
Fix template name of MiniCPM-V ( #6620 )
...
* fix template name
* tiny fix
Former-commit-id: 3077f20339
2025-01-13 16:46:48 +08:00
hoshi-hiyouga
0b47c2a293
Merge pull request #6598 from BUAADreamer/minicpmv
...
[model] Support MiniCPM-V
Former-commit-id: 6eec50c74d
2025-01-13 15:24:02 +08:00
fzc8578
313ce9a576
remove tests
...
Former-commit-id: a019cece80
2025-01-13 15:08:35 +08:00
fzc8578
ee87d318b8
fix tests
...
Former-commit-id: c2fa4cc7b1
2025-01-13 15:01:39 +08:00
fzc8578
4741eec2d1
fix style
...
Former-commit-id: 0cc7260a93
2025-01-13 14:19:38 +08:00
fzc8578
d2afe0c63c
fix system prompt and tests
...
Former-commit-id: cfaa8e4890
2025-01-13 14:18:06 +08:00
fzc8578
bdded9d41a
add some
...
Former-commit-id: 01e9cfd406
2025-01-11 15:03:20 +08:00
fzc8578
8c79fe6a5a
add cpm_o test
...
Former-commit-id: 10073319b4
2025-01-11 11:55:30 +08:00
fzc8578
63bb2b7235
add cpm_o test
...
Former-commit-id: c506f763df
2025-01-11 11:49:03 +08:00
fzc8578
e7f928adc4
fix format
...
Former-commit-id: 7b44f3127e
2025-01-11 01:27:40 +08:00
fzc8578
62c12a133e
add some
...
Former-commit-id: a650e114e9
2025-01-11 01:10:24 +08:00
fzc8578
08e8499a98
adapt to new mllm_param
...
Former-commit-id: 291384dea8
2025-01-11 00:16:34 +08:00
Zhangchi Feng
d5b18ee4a6
Merge branch 'main' into minicpmv
...
Former-commit-id: ed0895a9c1
2025-01-11 00:01:36 +08:00
hoshi-hiyouga
93cc1f167b
Merge pull request #6600 from hiyouga/hiyouga/refactor_mllm_param
...
[model] refactor mllm param logic
Former-commit-id: 382e932228
2025-01-10 23:53:37 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
fzc8578
9213e48fa2
add minicpmv2.6
...
Former-commit-id: e45329e745
2025-01-10 23:45:44 +08:00
fzc8578
0fb50f9c88
add some
...
Former-commit-id: 771cc80294
2025-01-10 23:29:06 +08:00
fzc8578
bcbe37ff52
add some
...
Former-commit-id: ae1f528df3
2025-01-10 21:25:32 +08:00
fzc8578
994049380d
fix some
...
Former-commit-id: 15bbcdf8d3
2025-01-10 20:55:52 +08:00
fzc8578
cc6a6f698f
fix version
...
Former-commit-id: d09032049c
2025-01-10 20:31:04 +08:00
fzc8578
7138b43873
fix some
...
Former-commit-id: 2ee8ba2f39
2025-01-10 20:27:06 +08:00
fzc8578
aeb4f82ef2
tiny fix
...
Former-commit-id: 84026be06e
2025-01-10 20:15:39 +08:00
Zhangchi Feng
f51ac40f0a
Merge branch 'main' into minicpmv
...
Former-commit-id: fc045d7dd8
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219
add some
...
Former-commit-id: 096a6cb67a
2025-01-10 20:01:22 +08:00
hoshi-hiyouga
4243c618f0
Merge pull request #6597 from hiyouga/hiyouga/upd_wechat
...
[assets] update wechat
Former-commit-id: b308ddf097
2025-01-10 18:41:47 +08:00
hiyouga
368d22f79a
update wechat
...
Former-commit-id: 70ed03b288
2025-01-10 10:40:25 +00:00