hoshi-hiyouga
5f38bcaba9
[deps] upgrade vllm ( #6857 )
2025-02-08 15:02:28 +08:00
Zhangchi Feng
24c7842948
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
e2dc5b952a
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
2025-02-05 01:53:33 +08:00
Zhangchi Feng
ab9bd068ef
[data] fix minicpmv plugin ( #6801 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
* update readme
* support dpo of minicpmv
* update init audio
* update init audio
* [model]fix image process in minicpmo
2025-02-04 21:20:15 +08:00
hoshi-hiyouga
94803d8133
[model] add mistral small models ( #6786 )
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
999c7c8fe0
[model] add qwen2.5 vl models ( #6779 )
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
15357cdad9
[breaking] support transformers 4.48 ( #6628 )
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
1f47b6186c
[model] support yarn ( #6693 )
2025-01-18 13:56:09 +08:00
hoshi-hiyouga
7bf09abf1c
fix qwen2 moe ( #6684 )
2025-01-17 13:46:09 +08:00
hoshi-hiyouga
7a04021d04
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
2025-01-15 01:42:50 +08:00
zhuHQ
d9189f9f0b
[optim] add support to APOLLO ( #6617 )
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
1278c3e92e
lint ( #6641 )
2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)
deacc00b12
Support InternLM3 Dense 8B Model ( #6640 )
...
* support internlm3
* update
* update
* update
* add hint
2025-01-14 18:07:27 +08:00
Xiaosu Zhu
58d029f321
Fix tokenizer max length ( #6632 )
2025-01-14 17:35:54 +08:00
Zhangchi Feng
158a127d34
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
2025-01-14 17:34:58 +08:00
Zhangchi Feng
c3fda5046d
Support new features of MiniCPM-V ( #6626 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
2025-01-14 00:26:19 +08:00
codingma
03de5ac912
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-01-13 19:43:36 +08:00
fzc8578
291384dea8
adapt to new mllm_param
2025-01-11 00:16:34 +08:00
Zhangchi Feng
ed0895a9c1
Merge branch 'main' into minicpmv
2025-01-11 00:01:36 +08:00
hiyouga
f6f630a1c9
refactor mllm param logic
2025-01-10 15:45:48 +00:00
fzc8578
15bbcdf8d3
fix some
2025-01-10 20:55:52 +08:00
fzc8578
2ee8ba2f39
fix some
2025-01-10 20:27:06 +08:00
fzc8578
84026be06e
tiny fix
2025-01-10 20:15:39 +08:00
Zhangchi Feng
fc045d7dd8
Merge branch 'main' into minicpmv
2025-01-10 20:12:07 +08:00
fzc8578
096a6cb67a
add some
2025-01-10 20:01:22 +08:00
hiyouga
47e17dd689
imporve log
2025-01-08 09:56:10 +00:00
fzc8578
785cc70ff2
add some
2025-01-06 19:32:39 +08:00
fzc8578
79c2d7090c
add some
2025-01-04 11:11:15 +08:00
Yaser Afshar
0943776326
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
2024-12-17 12:25:12 +00:00
hoshi-hiyouga
a665ad6178
Merge pull request #6364 from hiyouga/hiyouga/control_reenterent_gc
...
[model] support non-reenterent-gc
2024-12-17 19:58:36 +08:00
hiyouga
f319da6937
support non-reenterent-gc & fix #6358
2024-12-17 11:41:59 +00:00
hiyouga
2d107d3aef
generalized packing & fix #6343
2024-12-17 10:26:19 +00:00
hiyouga
99c62660c6
support qwen2vl train proj only
2024-12-05 10:37:42 +00:00
hoshi-hiyouga
75b586c31a
fix visual patch
2024-11-25 20:06:06 +08:00
hoshi-hiyouga
0516e556a7
fix #6136
2024-11-25 19:43:42 +08:00
hiyouga
df477370dc
add forbidden modules
2024-11-23 18:34:15 +00:00
hiyouga
446441fdb0
fix inputs
2024-11-23 18:26:02 +00:00
marko1616
b1e43e56db
Linter.
2024-11-23 16:09:04 +00:00
marko1616
8372c5e377
Tiny fix.
2024-11-23 16:09:01 +00:00
hiyouga
c38aa29336
support rank0 logger
2024-11-02 18:31:04 +08:00
hoshi-hiyouga
8408339d83
Merge pull request #5907 from hiyouga/hiyouga/dev
...
[data] fix template replace behavior
2024-11-02 13:42:53 +08:00
hiyouga
bfe1abd7af
fix #5904
2024-11-02 13:08:15 +08:00
hiyouga
93d3b8f43f
update tests
2024-11-02 12:41:44 +08:00
hiyouga
30567a1487
fix incorrect loss value for vlms
2024-10-30 08:56:46 +00:00
hoshi-hiyouga
0baa7735f6
Update visual.py
2024-10-29 22:10:29 +08:00
Kingsley
67f59579d7
Merge branch 'hiyouga:main' into pixtral-patch
2024-10-29 21:01:25 +08:00
hiyouga
21db8ed2f4
use pre-commit
2024-10-29 09:07:46 +00:00
hiyouga
77666bd227
update requires
2024-10-29 16:10:07 +08:00
hoshi-hiyouga
b4c7dd3ac5
fix #5797
2024-10-23 20:49:44 +08:00
hoshi-hiyouga
93b9067dfc
Update loader.py
2024-10-17 19:48:12 +08:00