Xiaosu Zhu
bc9ada9db7
[misc] update liger-kernel's monkey patch ( #7453 )
...
* Update liger_kernel.py
* Update setup.py
2025-03-25 11:58:52 +08:00
AbdelKarim ELJANDOUBI
b6dc7e01e2
[misc] enable liger kernel for gemma3 text and paligemma ( #7466 )
...
* add gemma3 text
* add paligemma (1,2 and 2 mix)
2025-03-25 09:27:43 +08:00
Kenny Lam
59a56f7226
[misc] enable liger kernel for gemma3 ( #7462 )
2025-03-24 19:09:59 +08:00
hoshi-hiyouga
ef5f1c1def
[data] gemma3 plugin pan and scan ( #7294 )
...
* gemma3 pan and scan
* add test case
* fix test
2025-03-13 23:29:23 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
165d3ed084
[model] support gemma3 ( #7273 )
2025-03-13 01:35:23 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
2baf8bf03d
[misc] fix lora regex ( #6944 )
...
* fix lora regex
* fix
Former-commit-id: 1ada3ae5a3
2025-02-14 21:38:43 +08:00
hoshi-hiyouga
13e1b7ee2b
[misc] fix grad ckpt ( #6931 )
...
Former-commit-id: c31c63b411
2025-02-13 23:27:51 +08:00
hoshi-hiyouga
cd493b91de
[model] add liger kernel to qwen2_5 vl ( #6930 )
...
* add liger kernel to qwen2_5 vl
* fix patch
* fix patch
Former-commit-id: 797043d29c
2025-02-13 23:05:54 +08:00
hoshi-hiyouga
036fb0d561
[misc] fix grad ckpt func ( #6916 )
...
Former-commit-id: e34c3c06da
2025-02-13 00:17:18 +08:00
hoshi-hiyouga
ff6658ad27
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
Zhangchi Feng
85f22d01bf
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
Former-commit-id: ab9bd068ef
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
f6779b0e0c
[breaking] support transformers 4.48 ( #6628 )
...
Former-commit-id: 15357cdad9
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
788accb601
fix qwen2 moe ( #6684 )
...
Former-commit-id: 7bf09abf1c
2025-01-17 13:46:09 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
864ee06243
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
Former-commit-id: deacc00b12
2025-01-14 18:07:27 +08:00
Xiaosu Zhu
a52496cc09
Fix tokenizer max length ( #6632 )
...
Former-commit-id: 58d029f321
2025-01-14 17:35:54 +08:00
Zhangchi Feng
ad119afc58
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d34
2025-01-14 17:34:58 +08:00
Zhangchi Feng
201a495154
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
Former-commit-id: c3fda5046d
2025-01-14 00:26:19 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
fzc8578
08e8499a98
adapt to new mllm_param
...
Former-commit-id: 291384dea8
2025-01-11 00:16:34 +08:00
Zhangchi Feng
d5b18ee4a6
Merge branch 'main' into minicpmv
...
Former-commit-id: ed0895a9c1
2025-01-11 00:01:36 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
fzc8578
994049380d
fix some
...
Former-commit-id: 15bbcdf8d3
2025-01-10 20:55:52 +08:00
fzc8578
7138b43873
fix some
...
Former-commit-id: 2ee8ba2f39
2025-01-10 20:27:06 +08:00
fzc8578
aeb4f82ef2
tiny fix
...
Former-commit-id: 84026be06e
2025-01-10 20:15:39 +08:00
Zhangchi Feng
f51ac40f0a
Merge branch 'main' into minicpmv
...
Former-commit-id: fc045d7dd8
2025-01-10 20:12:07 +08:00
fzc8578
165fe8e219
add some
...
Former-commit-id: 096a6cb67a
2025-01-10 20:01:22 +08:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
fzc8578
b9eeaa9706
add some
...
Former-commit-id: 785cc70ff2
2025-01-06 19:32:39 +08:00
fzc8578
b5ef5059ee
add some
...
Former-commit-id: 79c2d7090c
2025-01-04 11:11:15 +08:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hoshi-hiyouga
fc18db6290
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
...
[model] support non-reenterent-gc
Former-commit-id: a665ad6178
2024-12-17 19:58:36 +08:00
hiyouga
64bac4bc7e
support non-reenterent-gc & fix #6358
...
Former-commit-id: f319da6937
2024-12-17 11:41:59 +00:00
hiyouga
bff1b94583
generalized packing & fix #6343
...
Former-commit-id: 2d107d3aef
2024-12-17 10:26:19 +00:00
hiyouga
cf8cad8e7e
support qwen2vl train proj only
...
Former-commit-id: 99c62660c6
2024-12-05 10:37:42 +00:00
hoshi-hiyouga
a489f10986
fix visual patch
...
Former-commit-id: 75b586c31a
2024-11-25 20:06:06 +08:00
hoshi-hiyouga
f48d07cd6c
fix #6136
...
Former-commit-id: 0516e556a7
2024-11-25 19:43:42 +08:00
hiyouga
253752ccca
add forbidden modules
...
Former-commit-id: df477370dc
2024-11-23 18:34:15 +00:00