Commit Graph

90 Commits

Author SHA1 Message Date
Saiya
820ed764c4 [infer] support lora adapter for SGLang backend (#8067) 2025-05-16 23:33:47 +08:00
hoshi-hiyouga
937447bd8a [misc] fix qwen2 omni (#7962) 2025-05-06 15:39:13 +02:00
hoshi-hiyouga
52f25651a2 [model] add qwen2 omni 3b (#7945) 2025-05-03 16:36:51 +08:00
hoshi-hiyouga
6a584b4092 [hparam] add enable think argument (#7928) 2025-04-30 17:21:30 +08:00
hoshi-hiyouga
d8295cd601 [data] optimize qwen3 loss computation (#7923) 2025-04-30 16:18:00 +08:00
hoshi-hiyouga
0a0cfeb782 [breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci

* fix

* fix

* fix

* fix

* fix
2025-04-17 02:36:48 +08:00
hoshi-hiyouga
903db09822 [infer] vllm video/audio inference (#7566) 2025-04-02 02:27:04 +08:00
Qiaolin Yu
30038d9ce7 [inference] support sglang backend (#7278)
* Mimic SGLang offline Engine

* Add more tests and args

* Pass all current tests

* Clean Code

* fix sample_params

* clean code

* Fix Stream Chat

* change sglang from engine mode to server mode

* fix

* Fix Review Issues

* Use SGLang Built-In Utilities

* Fix test SGLang

* Some Doc Issue

* fix sglang engine

* add readme

---------

Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
9ccfb97a2c [misc] update format (#7277) 2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f [misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hoshi-hiyouga
317d0855d2 [infer] fix vllm args (#7235)
Former-commit-id: ef7af457fc
2025-03-11 01:15:35 +08:00
hoshi-hiyouga
5a29f49fb1 [config] update args (#7231)
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
eba31ae313 [webui] support escape html (#7190)
Former-commit-id: abb23f7673
2025-03-06 16:52:21 +08:00
hoshi-hiyouga
ee1b580328 [inference] fix hf_engine (#7120)
Former-commit-id: 1036311826
2025-03-01 05:22:49 +08:00
hoshi-hiyouga
184c5d0882 [misc] fix script (#6977)
Former-commit-id: cc8c7e762b
2025-02-18 17:00:46 +08:00
Zhangchi Feng
01915eaf40 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c7842948
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874 [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year

Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
445d643ef3 [model] add mistral small models (#6786)
Former-commit-id: 94803d8133
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79 [model] add qwen2.5 vl models (#6779)
Former-commit-id: 999c7c8fe0
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
91433d639c lint (#6641)
Former-commit-id: 1278c3e92e
2025-01-14 18:40:07 +08:00
Zhangchi Feng
ad119afc58 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

Former-commit-id: 158a127d34
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hiyouga
f07bad7144 fix paligemma infer
Former-commit-id: 84cd1188ac
2024-12-21 20:24:32 +00:00
Yaser Afshar
fe4546a7bb Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security


Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hiyouga
a94a1eac67 support control eos, fix #6345
Former-commit-id: eda76de32b
2024-12-17 10:42:05 +00:00
hiyouga
88b06a0c7f support qwen2vl vllm infer
Former-commit-id: 207f8b069c
2024-12-05 10:17:26 +00:00
hiyouga
65699c29d4 fix vllm
Former-commit-id: 13ee1f5cec
2024-11-25 00:07:24 +08:00
hiyouga
dcc67ac1a5 fix qwen2vl vllm infer
Former-commit-id: fa50fc470e
2024-11-24 23:27:24 +08:00
hiyouga
e99031daa4 fix inputs
Former-commit-id: 446441fdb0
2024-11-23 18:26:02 +00:00
hoshi-hiyouga
9c394f11ef fix #5988
Former-commit-id: 8d70edf39b
2024-11-11 13:57:14 +08:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
97f4451912 fix #5966
Former-commit-id: 8f3a32286e
2024-11-08 23:49:16 +08:00
hiyouga
2360d63ebc fix chat engines
Former-commit-id: 8c88065c38
2024-11-04 08:18:12 +00:00
hiyouga
e83cb17f97 support rank0 logger
Former-commit-id: c38aa29336
2024-11-02 18:31:04 +08:00
hiyouga
2eba98e152 add examples
Former-commit-id: e824b715ad
2024-11-01 08:41:54 +00:00
hiyouga
8ecc12ee2a support multiimage inference
Former-commit-id: e80a481927
2024-11-01 07:25:20 +00:00
hoshi-hiyouga
15786539d7 fix bug
Former-commit-id: bb0afda8fb
2024-10-29 22:19:04 +08:00
hoshi-hiyouga
90cd3538de Update hf_engine.py
Former-commit-id: 6e212fdab5
2024-10-29 22:00:59 +08:00
Kingsley
3053a806e9 Merge branch 'hiyouga:main' into pixtral-patch
Former-commit-id: 67f59579d7
2024-10-29 21:01:25 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
KUANGDD
62cbcb646a modify style & little change
Former-commit-id: 9d6143e36a
2024-10-23 15:24:07 +08:00
Kingsley
94ce8f561f fix some errors due to inconsistency of model cards
Former-commit-id: 2166b9bc6b
2024-09-30 19:58:34 +08:00
Kingsley
66e473d519 remove some unnecessary if conditions
Former-commit-id: de06e2678e
2024-09-28 02:14:06 +08:00
hiyouga
009500bc6d fix #5411
Former-commit-id: c7e51ff187
2024-09-11 17:36:42 +08:00
hiyouga
dc64166d13 update scripts
Former-commit-id: f2aa02c070
2024-09-08 14:17:41 +08:00
hiyouga
78cf256067 support vllm 0.6.0
Former-commit-id: b6681d7198
2024-09-08 02:26:20 +08:00
hiyouga
7ccb86b215 add docstrings, refactor logger
Former-commit-id: 54c6905937
2024-09-08 00:56:56 +08:00