Saiya
820ed764c4
[infer] support lora adapter for SGLang backend ( #8067 )
2025-05-16 23:33:47 +08:00
hoshi-hiyouga
937447bd8a
[misc] fix qwen2 omni ( #7962 )
2025-05-06 15:39:13 +02:00
hoshi-hiyouga
52f25651a2
[model] add qwen2 omni 3b ( #7945 )
2025-05-03 16:36:51 +08:00
hoshi-hiyouga
6a584b4092
[hparam] add enable think argument ( #7928 )
2025-04-30 17:21:30 +08:00
hoshi-hiyouga
d8295cd601
[data] optimize qwen3 loss computation ( #7923 )
2025-04-30 16:18:00 +08:00
hoshi-hiyouga
0a0cfeb782
[breaking] bump transformers to 4.45.0 & improve ci ( #7746 )
...
* update ci
* fix
* fix
* fix
* fix
* fix
2025-04-17 02:36:48 +08:00
hoshi-hiyouga
903db09822
[infer] vllm video/audio inference ( #7566 )
2025-04-02 02:27:04 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend ( #7278 )
...
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu >
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
317d0855d2
[infer] fix vllm args ( #7235 )
...
Former-commit-id: ef7af457fc
2025-03-11 01:15:35 +08:00
hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
eba31ae313
[webui] support escape html ( #7190 )
...
Former-commit-id: abb23f7673
2025-03-06 16:52:21 +08:00
hoshi-hiyouga
ee1b580328
[inference] fix hf_engine ( #7120 )
...
Former-commit-id: 1036311826
2025-03-01 05:22:49 +08:00
hoshi-hiyouga
184c5d0882
[misc] fix script ( #6977 )
...
Former-commit-id: cc8c7e762b
2025-02-18 17:00:46 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Zhangchi Feng
ad119afc58
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d34
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
813f5919a3
fix #6482
...
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144
fix paligemma infer
...
Former-commit-id: 84cd1188ac
2024-12-21 20:24:32 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hiyouga
a94a1eac67
support control eos, fix #6345
...
Former-commit-id: eda76de32b
2024-12-17 10:42:05 +00:00
hiyouga
88b06a0c7f
support qwen2vl vllm infer
...
Former-commit-id: 207f8b069c
2024-12-05 10:17:26 +00:00
hiyouga
65699c29d4
fix vllm
...
Former-commit-id: 13ee1f5cec
2024-11-25 00:07:24 +08:00
hiyouga
dcc67ac1a5
fix qwen2vl vllm infer
...
Former-commit-id: fa50fc470e
2024-11-24 23:27:24 +08:00
hiyouga
e99031daa4
fix inputs
...
Former-commit-id: 446441fdb0
2024-11-23 18:26:02 +00:00
hoshi-hiyouga
9c394f11ef
fix #5988
...
Former-commit-id: 8d70edf39b
2024-11-11 13:57:14 +08:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
97f4451912
fix #5966
...
Former-commit-id: 8f3a32286e
2024-11-08 23:49:16 +08:00
hiyouga
2360d63ebc
fix chat engines
...
Former-commit-id: 8c88065c38
2024-11-04 08:18:12 +00:00
hiyouga
e83cb17f97
support rank0 logger
...
Former-commit-id: c38aa29336
2024-11-02 18:31:04 +08:00
hiyouga
2eba98e152
add examples
...
Former-commit-id: e824b715ad
2024-11-01 08:41:54 +00:00
hiyouga
8ecc12ee2a
support multiimage inference
...
Former-commit-id: e80a481927
2024-11-01 07:25:20 +00:00
hoshi-hiyouga
15786539d7
fix bug
...
Former-commit-id: bb0afda8fb
2024-10-29 22:19:04 +08:00
hoshi-hiyouga
90cd3538de
Update hf_engine.py
...
Former-commit-id: 6e212fdab5
2024-10-29 22:00:59 +08:00
Kingsley
3053a806e9
Merge branch 'hiyouga:main' into pixtral-patch
...
Former-commit-id: 67f59579d7
2024-10-29 21:01:25 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
KUANGDD
62cbcb646a
modify style & little change
...
Former-commit-id: 9d6143e36a
2024-10-23 15:24:07 +08:00
Kingsley
94ce8f561f
fix some errors due to inconsistency of model cards
...
Former-commit-id: 2166b9bc6b
2024-09-30 19:58:34 +08:00
Kingsley
66e473d519
remove some unnecessary if conditions
...
Former-commit-id: de06e2678e
2024-09-28 02:14:06 +08:00
hiyouga
009500bc6d
fix #5411
...
Former-commit-id: c7e51ff187
2024-09-11 17:36:42 +08:00
hiyouga
dc64166d13
update scripts
...
Former-commit-id: f2aa02c070
2024-09-08 14:17:41 +08:00
hiyouga
78cf256067
support vllm 0.6.0
...
Former-commit-id: b6681d7198
2024-09-08 02:26:20 +08:00
hiyouga
7ccb86b215
add docstrings, refactor logger
...
Former-commit-id: 54c6905937
2024-09-08 00:56:56 +08:00