Commit Graph

111 Commits

Author SHA1 Message Date
hoshi-hiyouga
5f38bcaba9 [deps] upgrade vllm (#6857) 2025-02-08 15:02:28 +08:00
Zhangchi Feng
24c7842948 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
e2dc5b952a [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
94803d8133 [model] add mistral small models (#6786) 2025-02-01 04:31:38 +08:00
hoshi-hiyouga
999c7c8fe0 [model] add qwen2.5 vl models (#6779) 2025-01-31 03:00:29 +08:00
hoshi-hiyouga
15357cdad9 [breaking] support transformers 4.48 (#6628) 2025-01-31 01:36:33 +08:00
hoshi-hiyouga
1f47b6186c [model] support yarn (#6693) 2025-01-18 13:56:09 +08:00
hoshi-hiyouga
7bf09abf1c fix qwen2 moe (#6684) 2025-01-17 13:46:09 +08:00
hoshi-hiyouga
7a04021d04 [optim] clean apollo (#6645)
* clean apollo code

* update readme
2025-01-15 01:42:50 +08:00
zhuHQ
d9189f9f0b [optim] add support to APOLLO (#6617) 2025-01-15 00:24:56 +08:00
Zhangchi Feng
158a127d34 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv
2025-01-14 17:34:58 +08:00
Zhangchi Feng
c3fda5046d Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
codingma
03de5ac912 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-01-13 19:43:36 +08:00
fzc8578
291384dea8 adapt to new mllm_param 2025-01-11 00:16:34 +08:00
Zhangchi Feng
ed0895a9c1 Merge branch 'main' into minicpmv 2025-01-11 00:01:36 +08:00
hiyouga
f6f630a1c9 refactor mllm param logic 2025-01-10 15:45:48 +00:00
fzc8578
2ee8ba2f39 fix some 2025-01-10 20:27:06 +08:00
fzc8578
84026be06e tiny fix 2025-01-10 20:15:39 +08:00
Zhangchi Feng
fc045d7dd8 Merge branch 'main' into minicpmv 2025-01-10 20:12:07 +08:00
fzc8578
096a6cb67a add some 2025-01-10 20:01:22 +08:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
fzc8578
785cc70ff2 add some 2025-01-06 19:32:39 +08:00
fzc8578
79c2d7090c add some 2025-01-04 11:11:15 +08:00
Yaser Afshar
0943776326 Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security
2024-12-17 12:25:12 +00:00
hoshi-hiyouga
a665ad6178 Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
[model] support non-reenterent-gc
2024-12-17 19:58:36 +08:00
hiyouga
f319da6937 support non-reenterent-gc & fix #6358 2024-12-17 11:41:59 +00:00
hiyouga
2d107d3aef generalized packing & fix #6343 2024-12-17 10:26:19 +00:00
hiyouga
99c62660c6 support qwen2vl train proj only 2024-12-05 10:37:42 +00:00
hoshi-hiyouga
75b586c31a fix visual patch 2024-11-25 20:06:06 +08:00
hoshi-hiyouga
0516e556a7 fix #6136 2024-11-25 19:43:42 +08:00
hiyouga
df477370dc add forbidden modules 2024-11-23 18:34:15 +00:00
hiyouga
446441fdb0 fix inputs 2024-11-23 18:26:02 +00:00
marko1616
b1e43e56db Linter. 2024-11-23 16:09:04 +00:00
marko1616
8372c5e377 Tiny fix. 2024-11-23 16:09:01 +00:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
hiyouga
93d3b8f43f update tests 2024-11-02 12:41:44 +08:00
hiyouga
30567a1487 fix incorrect loss value for vlms 2024-10-30 08:56:46 +00:00
hoshi-hiyouga
0baa7735f6 Update visual.py 2024-10-29 22:10:29 +08:00
Kingsley
67f59579d7 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-29 21:01:25 +08:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
hiyouga
77666bd227 update requires 2024-10-29 16:10:07 +08:00
Kingsley
93a441a6b7 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-08 21:04:08 +08:00
hiyouga
451d271718 tiny fix 2024-10-08 17:48:56 +08:00
Kingsley
e53f47c0b3 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-01 00:52:31 +08:00
hiyouga
fe7ffccdb9 fix #5542 2024-09-30 23:28:55 +08:00
Kingsley
9ddb84052e sync with former 2024-09-30 20:27:05 +08:00
Kingsley
2166b9bc6b fix some errors due to inconsistency of model cards 2024-09-30 19:58:34 +08:00
Zhangchi Feng
26f45829b4 Merge branch 'main' into pixtral-patch 2024-09-30 12:37:03 +08:00
BUAADreamer
485fc04716 fix constants 2024-09-29 22:00:01 +08:00
BUAADreamer
23916d57c1 fix style 2024-09-29 21:39:37 +08:00