hoshi-hiyouga
|
9b5baa97f0
|
[data] qwen3 fixes (#8109)
|
2025-05-20 02:00:30 +08:00 |
|
Saiya
|
ab41f7956c
|
[infer] support lora adapter for SGLang backend (#8067)
|
2025-05-16 23:33:47 +08:00 |
|
hoshi-hiyouga
|
ce7032e1b3
|
[model] add qwen2 omni 3b (#7945)
|
2025-05-03 16:36:51 +08:00 |
|
hoshi-hiyouga
|
13b05e74f1
|
[hparam] add enable think argument (#7928)
|
2025-04-30 17:21:30 +08:00 |
|
hoshi-hiyouga
|
052ca871bd
|
[data] optimize qwen3 loss computation (#7923)
|
2025-04-30 16:18:00 +08:00 |
|
hoshi-hiyouga
|
5e22597ff1
|
[infer] vllm video/audio inference (#7566)
|
2025-04-02 02:27:04 +08:00 |
|
Qiaolin Yu
|
a44a53ebec
|
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-15 04:37:58 +08:00 |
|