hoshi-hiyouga
|
5f38bcaba9
|
[deps] upgrade vllm (#6857)
|
2025-02-08 15:02:28 +08:00 |
|
Zhangchi Feng
|
24c7842948
|
[model] support audio (#6701)
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-02-05 04:59:09 +08:00 |
|
hoshi-hiyouga
|
e2dc5b952a
|
[misc] update license year & fix llama pro (#6814)
* fix llamapro script
* change year
|
2025-02-05 01:53:33 +08:00 |
|
hoshi-hiyouga
|
94803d8133
|
[model] add mistral small models (#6786)
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
999c7c8fe0
|
[model] add qwen2.5 vl models (#6779)
|
2025-01-31 03:00:29 +08:00 |
|
hoshi-hiyouga
|
15357cdad9
|
[breaking] support transformers 4.48 (#6628)
|
2025-01-31 01:36:33 +08:00 |
|
hoshi-hiyouga
|
1f47b6186c
|
[model] support yarn (#6693)
|
2025-01-18 13:56:09 +08:00 |
|
hoshi-hiyouga
|
7bf09abf1c
|
fix qwen2 moe (#6684)
|
2025-01-17 13:46:09 +08:00 |
|
hoshi-hiyouga
|
7a04021d04
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
d9189f9f0b
|
[optim] add support to APOLLO (#6617)
|
2025-01-15 00:24:56 +08:00 |
|
Zhangchi Feng
|
158a127d34
|
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
|
2025-01-14 17:34:58 +08:00 |
|
Zhangchi Feng
|
c3fda5046d
|
Support new features of MiniCPM-V (#6626)
* fix template name
* tiny fix
* support minicpm-o-2.6
|
2025-01-14 00:26:19 +08:00 |
|
codingma
|
03de5ac912
|
add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-01-13 19:43:36 +08:00 |
|
fzc8578
|
291384dea8
|
adapt to new mllm_param
|
2025-01-11 00:16:34 +08:00 |
|
Zhangchi Feng
|
ed0895a9c1
|
Merge branch 'main' into minicpmv
|
2025-01-11 00:01:36 +08:00 |
|
hiyouga
|
f6f630a1c9
|
refactor mllm param logic
|
2025-01-10 15:45:48 +00:00 |
|
fzc8578
|
2ee8ba2f39
|
fix some
|
2025-01-10 20:27:06 +08:00 |
|
fzc8578
|
84026be06e
|
tiny fix
|
2025-01-10 20:15:39 +08:00 |
|
Zhangchi Feng
|
fc045d7dd8
|
Merge branch 'main' into minicpmv
|
2025-01-10 20:12:07 +08:00 |
|
fzc8578
|
096a6cb67a
|
add some
|
2025-01-10 20:01:22 +08:00 |
|
hiyouga
|
47e17dd689
|
imporve log
|
2025-01-08 09:56:10 +00:00 |
|
fzc8578
|
785cc70ff2
|
add some
|
2025-01-06 19:32:39 +08:00 |
|
fzc8578
|
79c2d7090c
|
add some
|
2025-01-04 11:11:15 +08:00 |
|
Yaser Afshar
|
0943776326
|
Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
|
2024-12-17 12:25:12 +00:00 |
|
hoshi-hiyouga
|
a665ad6178
|
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
[model] support non-reenterent-gc
|
2024-12-17 19:58:36 +08:00 |
|
hiyouga
|
f319da6937
|
support non-reenterent-gc & fix #6358
|
2024-12-17 11:41:59 +00:00 |
|
hiyouga
|
2d107d3aef
|
generalized packing & fix #6343
|
2024-12-17 10:26:19 +00:00 |
|
hiyouga
|
99c62660c6
|
support qwen2vl train proj only
|
2024-12-05 10:37:42 +00:00 |
|
hoshi-hiyouga
|
75b586c31a
|
fix visual patch
|
2024-11-25 20:06:06 +08:00 |
|
hoshi-hiyouga
|
0516e556a7
|
fix #6136
|
2024-11-25 19:43:42 +08:00 |
|
hiyouga
|
df477370dc
|
add forbidden modules
|
2024-11-23 18:34:15 +00:00 |
|
hiyouga
|
446441fdb0
|
fix inputs
|
2024-11-23 18:26:02 +00:00 |
|
marko1616
|
b1e43e56db
|
Linter.
|
2024-11-23 16:09:04 +00:00 |
|
marko1616
|
8372c5e377
|
Tiny fix.
|
2024-11-23 16:09:01 +00:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
93d3b8f43f
|
update tests
|
2024-11-02 12:41:44 +08:00 |
|
hiyouga
|
30567a1487
|
fix incorrect loss value for vlms
|
2024-10-30 08:56:46 +00:00 |
|
hoshi-hiyouga
|
0baa7735f6
|
Update visual.py
|
2024-10-29 22:10:29 +08:00 |
|
Kingsley
|
67f59579d7
|
Merge branch 'hiyouga:main' into pixtral-patch
|
2024-10-29 21:01:25 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
77666bd227
|
update requires
|
2024-10-29 16:10:07 +08:00 |
|
Kingsley
|
93a441a6b7
|
Merge branch 'hiyouga:main' into pixtral-patch
|
2024-10-08 21:04:08 +08:00 |
|
hiyouga
|
451d271718
|
tiny fix
|
2024-10-08 17:48:56 +08:00 |
|
Kingsley
|
e53f47c0b3
|
Merge branch 'hiyouga:main' into pixtral-patch
|
2024-10-01 00:52:31 +08:00 |
|
hiyouga
|
fe7ffccdb9
|
fix #5542
|
2024-09-30 23:28:55 +08:00 |
|
Kingsley
|
9ddb84052e
|
sync with former
|
2024-09-30 20:27:05 +08:00 |
|
Kingsley
|
2166b9bc6b
|
fix some errors due to inconsistency of model cards
|
2024-09-30 19:58:34 +08:00 |
|
Zhangchi Feng
|
26f45829b4
|
Merge branch 'main' into pixtral-patch
|
2024-09-30 12:37:03 +08:00 |
|
BUAADreamer
|
485fc04716
|
fix constants
|
2024-09-29 22:00:01 +08:00 |
|
BUAADreamer
|
23916d57c1
|
fix style
|
2024-09-29 21:39:37 +08:00 |
|