curnane-lab
|
2092abc217
|
[npu] add Qwen3.5 support with Partial RoPE and Hybrid Attention (#10421)
Co-authored-by: Curnane <mingliangfu@users.noreply.github.com>
|
2026-04-27 23:36:07 +08:00 |
|
xvxuopop
|
e8deda53a1
|
[example] add Qwen3 series examples (#9624)
Co-authored-by: UsernameFull <tohowtodoit@gmail.com>
|
2025-12-18 21:27:00 +08:00 |
|
Yaowei Zheng
|
5d56817e2b
|
[misc] lint (#9593)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-12-09 18:00:35 +08:00 |
|
xvxuopop
|
955396e8a5
|
[example] correct the parameter errors in the examples file. (#9543)
|
2025-11-27 17:38:38 +08:00 |
|
xvxuopop
|
2c4fb3c97e
|
[v1] Support fused moe kernel for qwen3vlmoe model. (#9532)
|
2025-11-27 02:13:33 +08:00 |
|