Commit Graph

2858 Commits

Author SHA1 Message Date
Yaowei Zheng
5d56817e2b [misc] lint (#9593)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-12-09 18:00:35 +08:00
Yaowei Zheng
1bbb461f76 [assets] update readme (#9587) 2025-12-09 12:22:54 +08:00
Hertz
c1f5f8fff6 [model] support GLM4.6v (#9586) 2025-12-09 11:06:42 +08:00
Yaowei Zheng
5744f1ea94 [v1] add models & accelerator (#9579) 2025-12-08 02:30:25 +08:00
tangefly
739954910a [deps] Update for Transformers v5 (#9569) 2025-12-08 01:13:32 +08:00
xvxuopop
109162dc56 [fix] fix the issue when using fsdp2 with gradient checkpointing. (#9541)
Co-authored-by: jin-yongxu <jinyongxu@h-partners.com>
2025-12-06 16:04:51 +08:00
jiaqiw09
165f3f073a [examples] add fsdp config for mutiple nodes (#9575)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-12-05 23:22:48 +08:00
jiaqiw09
efb13b7483 [V1] Refactor ascend MoE kernel patch logic & Support Qwen3-MoE (#9557) 2025-12-02 00:22:03 +08:00
Username_Full
e43a972b25 [test] add npu test yaml and add ascend a3 docker file (#9547)
Co-authored-by: jiaqiw09 <jiaqiw960714@gmail.com>
2025-11-30 09:37:08 +08:00
Kingsley
22be45c78c [misc] fix omni thinker load (#9552)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-11-30 09:36:36 +08:00
浮梦
d1f585f80a [test] update test cmd (#9544)
Co-authored-by: frozenleaves <frozen@Mac.local>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-11-27 17:59:42 +08:00
xvxuopop
955396e8a5 [example] correct the parameter errors in the examples file. (#9543) 2025-11-27 17:38:38 +08:00
xvxuopop
231756a5bf [chat] fix the error when the vLLM version is greater than 0.10.0 (#9539)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-11-27 02:14:53 +08:00
xvxuopop
2c4fb3c97e [v1] Support fused moe kernel for qwen3vlmoe model. (#9532) 2025-11-27 02:13:33 +08:00
浮梦
2b6f16f261 [model] temporarily support npu fused options on v0, powered by v1 kernels (#9520)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-27 02:08:36 +08:00
浮梦
f17efde693 [v1] support automatic discovery of registered kernels. (#9509)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-27 01:47:22 +08:00
Hertz
591fc9ed02 [model] support ERNIE-4.5-VL Models (#9521) 2025-11-24 16:48:06 +08:00
Peilin Li
3140c242f0 [assets] add README with KT+llamafactory (#9514) 2025-11-19 16:50:45 +08:00
Peilin Li
887c562d60 [example] Add KTransformers Qwen3MoE example (#9511)
Co-authored-by: unknown <xiongchenhui@hisense.ad>
Co-authored-by: Kingsley <kingsleydodonow@gmail.com>
2025-11-19 00:53:28 +08:00
Edge-Seven
9779b1f361 [misc] fix typos in some files (#9505)
Co-authored-by: khanhkhanhlele <namkhanh20xx@gmail.com>
2025-11-18 20:36:01 +08:00
Yinlei Sun
45f0437a14 [v1] Add support for ShareGPT format. (#9486) 2025-11-18 13:44:08 +08:00
浮梦
d4e120423d [data] fix qwen3omni moe model (#9501)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-18 13:43:22 +08:00
Pory
10a446e373 [model] ktransformers qwen3 support (#9485)
Co-authored-by: unknown <xiongchenhui@hisense.ad>
2025-11-13 20:09:44 +08:00
jiaqiw09
0aa4a051af [test] support slow skip and device skip in Uts (#9484) 2025-11-13 20:08:22 +08:00
Yaowei Zheng
8173a88a26 [assets] update readme (#9477) 2025-11-12 16:15:41 +08:00
Kingsley
fef86fa7fe [data] fix qwen3omni audio length calculation (#9467) 2025-11-12 10:37:15 +08:00
taohongsheng
5afa851f71 [misc] Modify pip install command for huggingface_hub (#9463) 2025-11-10 23:04:00 +08:00
MyungHa Kwon
a711bce664 [data] add openai format (#9449) 2025-11-06 20:10:20 +08:00
魅影
bd24350cbf [v1] add pair data converter (#9360)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-06 14:05:58 +08:00
Peilin Li
bd30c0003b [train] fix denominator of ga in ksft loss (#9409) 2025-11-05 20:53:23 +08:00
魅影
8edd2622ce [docker] update npu dockerfile (#9407)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-05 18:28:32 +08:00
Yaowei Zheng
eaf963f67f [model] update kt code (#9406) 2025-11-05 15:27:22 +08:00
Kingsley
56f45e826f [train] fix MPO re-weight (#9405) 2025-11-04 21:10:41 +08:00
魅影
14abb75126 [model] enable using FA in npu (#9397)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-04 19:32:30 +08:00
한송민
5a9939050e [model] add deepstack_merger_list to Qwen3-VL vision_model_keys (#9399) 2025-11-04 19:27:34 +08:00
Peilin Li
934b3084ee [train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-11-04 15:54:12 +08:00
Yaowei Zheng
3ae15da9c0 [misc] lint code (#9395) 2025-11-03 22:08:59 +08:00
魅影
215580c77d [data] fix mm pluigin for qwen omni video training (#9388)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-03 11:44:27 +08:00
魅影
767b344fb4 [model] remove npu sdpa patch (#9368)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-10-30 16:26:35 +08:00
Kingsley
3057db15c3 [readme] upd mcore readme (#9352) 2025-10-27 21:23:31 +08:00
Kingsley
13170577b2 [feat] support megatron-LM training by mcore_adapter (#9237)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-10-26 16:21:30 +08:00
Xiaosu Zhu
129e918106 [data] Fix Qwen3VL plugin (#9297)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
2025-10-26 16:07:04 +08:00
Yaowei Zheng
9c0d033a15 [model] add qwen3vl 2b & 32b (#9343) 2025-10-24 13:22:36 +08:00
Yaowei Zheng
2a822178de [deps] fix yanked packages (#9333) 2025-10-22 20:54:51 +08:00
Kingsley
b842457ef4 [ci] revert mac os ci setup (#9316) 2025-10-21 18:26:12 +08:00
魅影
2c6aded5d4 [v1] kernel plugin (#9274)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-10-18 18:02:14 +08:00
Yaowei Zheng
d9d67ba62d [misc] fix import error (#9299) 2025-10-17 17:46:27 +08:00
Yaowei Zheng
a442fa90ad [misc] fix import error (#9296) 2025-10-17 10:54:30 +08:00
wyfdgg
8c341cbaae [model] support hunyuan-mt model (#9284)
Co-authored-by: wyfdgg <liwenkun0812@163.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-10-17 10:33:09 +08:00
Yaowei Zheng
47a7dc1698 [deps] upgrade vllm (#9293) 2025-10-16 23:20:26 +08:00