Saiya
820ed764c4
[infer] support lora adapter for SGLang backend ( #8067 )
2025-05-16 23:33:47 +08:00
hoshi-hiyouga
937447bd8a
[misc] fix qwen2 omni ( #7962 )
2025-05-06 15:39:13 +02:00
hoshi-hiyouga
52f25651a2
[model] add qwen2 omni 3b ( #7945 )
2025-05-03 16:36:51 +08:00
hoshi-hiyouga
6a584b4092
[hparam] add enable think argument ( #7928 )
2025-04-30 17:21:30 +08:00
hoshi-hiyouga
d8295cd601
[data] optimize qwen3 loss computation ( #7923 )
2025-04-30 16:18:00 +08:00
hoshi-hiyouga
0a0cfeb782
[breaking] bump transformers to 4.45.0 & improve ci ( #7746 )
...
* update ci
* fix
* fix
* fix
* fix
* fix
2025-04-17 02:36:48 +08:00
hoshi-hiyouga
903db09822
[infer] vllm video/audio inference ( #7566 )
2025-04-02 02:27:04 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend ( #7278 )
...
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
317d0855d2
[infer] fix vllm args ( #7235 )
...
Former-commit-id: ef7af457fc44b1e8cad0c78717848617f98364f0
2025-03-11 01:15:35 +08:00
hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
eba31ae313
[webui] support escape html ( #7190 )
...
Former-commit-id: abb23f767351098a926202ea4edc94d9e9a4681c
2025-03-06 16:52:21 +08:00
hoshi-hiyouga
ee1b580328
[inference] fix hf_engine ( #7120 )
...
Former-commit-id: 1036311826a61fed2346a261c8a060c355778318
2025-03-01 05:22:49 +08:00
hoshi-hiyouga
184c5d0882
[misc] fix script ( #6977 )
...
Former-commit-id: cc8c7e762b9c873ef79529152465bbed9231053c
2025-02-18 17:00:46 +08:00
Zhangchi Feng
01915eaf40
[model] support audio ( #6701 )
...
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
445d643ef3
[model] add mistral small models ( #6786 )
...
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
91433d639c
lint ( #6641 )
...
Former-commit-id: 1278c3e92eeb297e883aab89e2384c1df1d0e910
2025-01-14 18:40:07 +08:00
Zhangchi Feng
ad119afc58
Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 ( #6631 )
...
* fix template name
* tiny fix
* support minicpm-o-2.6
* support inference of minicpmv
Former-commit-id: 158a127d340d5e4ca23263ffad042f861fd77deb
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
2025-01-08 09:56:10 +00:00
hiyouga
813f5919a3
fix #6482
...
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144
fix paligemma infer
...
Former-commit-id: 84cd1188ac03c165e1a626db297936c2458627d6
2024-12-21 20:24:32 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 09437763267bc7081159a6878cee9652a2b1ddac
2024-12-17 12:25:12 +00:00
hiyouga
a94a1eac67
support control eos, fix #6345
...
Former-commit-id: eda76de32bab103c650f246327d214539ae6f291
2024-12-17 10:42:05 +00:00
hiyouga
88b06a0c7f
support qwen2vl vllm infer
...
Former-commit-id: 207f8b069ca35a28de4588b4962e7254f451c52c
2024-12-05 10:17:26 +00:00
hiyouga
65699c29d4
fix vllm
...
Former-commit-id: 13ee1f5cec815590c5d290f0aca264e6d16ddd5d
2024-11-25 00:07:24 +08:00
hiyouga
dcc67ac1a5
fix qwen2vl vllm infer
...
Former-commit-id: fa50fc470e46010214f0e509ef75a6c167caac7f
2024-11-24 23:27:24 +08:00
hiyouga
e99031daa4
fix inputs
...
Former-commit-id: 446441fdb020b5a102480251cb8536dd8b3f8f99
2024-11-23 18:26:02 +00:00
hoshi-hiyouga
9c394f11ef
fix #5988
...
Former-commit-id: 8d70edf39bce7cb7be272f0bd2820306adf093ad
2024-11-11 13:57:14 +08:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
hiyouga
97f4451912
fix #5966
...
Former-commit-id: 8f3a32286ebcfb3234e3981db2292dd165b1568d
2024-11-08 23:49:16 +08:00
hiyouga
2360d63ebc
fix chat engines
...
Former-commit-id: 8c88065c38d0e7dd60e06e27c7f921f7931471ee
2024-11-04 08:18:12 +00:00
hiyouga
e83cb17f97
support rank0 logger
...
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
2024-11-02 18:31:04 +08:00
hiyouga
2eba98e152
add examples
...
Former-commit-id: e824b715ad4bf885241b245b12d75563adab2e26
2024-11-01 08:41:54 +00:00
hiyouga
8ecc12ee2a
support multiimage inference
...
Former-commit-id: e80a4819274d46ac9e85db7469dc59d7c4e323c7
2024-11-01 07:25:20 +00:00
hoshi-hiyouga
15786539d7
fix bug
...
Former-commit-id: bb0afda8fbb7d3556c1742bccf0b33ee3bf0f0b6
2024-10-29 22:19:04 +08:00
hoshi-hiyouga
90cd3538de
Update hf_engine.py
...
Former-commit-id: 6e212fdab5f48c955db250ecfc197b89f8856e4b
2024-10-29 22:00:59 +08:00
Kingsley
3053a806e9
Merge branch 'hiyouga:main' into pixtral-patch
...
Former-commit-id: 67f59579d79e97689a4b3cba7101a423c30dab2b
2024-10-29 21:01:25 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
KUANGDD
62cbcb646a
modify style & little change
...
Former-commit-id: 9d6143e36a12e0f295139d057aeb1843535435cf
2024-10-23 15:24:07 +08:00
Kingsley
94ce8f561f
fix some errors due to inconsistency of model cards
...
Former-commit-id: 2166b9bc6ba35760ff85b63620af9fa0213a4c78
2024-09-30 19:58:34 +08:00
Kingsley
66e473d519
remove some unnecessary if conditions
...
Former-commit-id: de06e2678e2168586614242f65939c5772e78774
2024-09-28 02:14:06 +08:00
hiyouga
009500bc6d
fix #5411
...
Former-commit-id: c7e51ff187658eb472c2b234f75d8934c6f7c782
2024-09-11 17:36:42 +08:00
hiyouga
dc64166d13
update scripts
...
Former-commit-id: f2aa02c070b1442bfb58a075f22af25386192bfa
2024-09-08 14:17:41 +08:00
hiyouga
78cf256067
support vllm 0.6.0
...
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
2024-09-08 02:26:20 +08:00
hiyouga
7ccb86b215
add docstrings, refactor logger
...
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
2024-09-08 00:56:56 +08:00