2845 Commits

Author SHA1 Message Date
xvxuopop
2c4fb3c97e
[v1] Support fused moe kernel for qwen3vlmoe model. (#9532) 2025-11-27 02:13:33 +08:00
浮梦
2b6f16f261
[model] temporarily support npu fused options on v0, powered by v1 kernels (#9520)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-27 02:08:36 +08:00
浮梦
f17efde693
[v1] support automatic discovery of registered kernels. (#9509)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-27 01:47:22 +08:00
Hertz
591fc9ed02
[model] support ERNIE-4.5-VL Models (#9521) 2025-11-24 16:48:06 +08:00
Peilin Li
3140c242f0
[assets] add README with KT+llamafactory (#9514) 2025-11-19 16:50:45 +08:00
Peilin Li
887c562d60
[example] Add KTransformers Qwen3MoE example (#9511)
Co-authored-by: unknown <xiongchenhui@hisense.ad>
Co-authored-by: Kingsley <kingsleydodonow@gmail.com>
2025-11-19 00:53:28 +08:00
Edge-Seven
9779b1f361
[misc] fix typos in some files (#9505)
Co-authored-by: khanhkhanhlele <namkhanh20xx@gmail.com>
2025-11-18 20:36:01 +08:00
Yinlei Sun
45f0437a14
[v1] Add support for ShareGPT format. (#9486) 2025-11-18 13:44:08 +08:00
浮梦
d4e120423d
[data] fix qwen3omni moe model (#9501)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-18 13:43:22 +08:00
Pory
10a446e373
[model] ktransformers qwen3 support (#9485)
Co-authored-by: unknown <xiongchenhui@hisense.ad>
2025-11-13 20:09:44 +08:00
jiaqiw09
0aa4a051af
[test] support slow skip and device skip in Uts (#9484) 2025-11-13 20:08:22 +08:00
Yaowei Zheng
8173a88a26
[assets] update readme (#9477) 2025-11-12 16:15:41 +08:00
Kingsley
fef86fa7fe
[data] fix qwen3omni audio length calculation (#9467) 2025-11-12 10:37:15 +08:00
taohongsheng
5afa851f71
[misc] Modify pip install command for huggingface_hub (#9463) 2025-11-10 23:04:00 +08:00
MyungHa Kwon
a711bce664
[data] add openai format (#9449) 2025-11-06 20:10:20 +08:00
魅影
bd24350cbf
[v1] add pair data converter (#9360)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-06 14:05:58 +08:00
Peilin Li
bd30c0003b
[train] fix denominator of ga in ksft loss (#9409) 2025-11-05 20:53:23 +08:00
魅影
8edd2622ce
[docker] update npu dockerfile (#9407)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-05 18:28:32 +08:00
Yaowei Zheng
eaf963f67f
[model] update kt code (#9406) 2025-11-05 15:27:22 +08:00
Kingsley
56f45e826f
[train] fix MPO re-weight (#9405) 2025-11-04 21:10:41 +08:00
魅影
14abb75126
[model] enable using FA in npu (#9397)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-04 19:32:30 +08:00
한송민
5a9939050e
[model] add deepstack_merger_list to Qwen3-VL vision_model_keys (#9399) 2025-11-04 19:27:34 +08:00
Peilin Li
934b3084ee
[train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-11-04 15:54:12 +08:00
Yaowei Zheng
3ae15da9c0
[misc] lint code (#9395) 2025-11-03 22:08:59 +08:00
魅影
215580c77d
[data] fix mm pluigin for qwen omni video training (#9388)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-11-03 11:44:27 +08:00
魅影
767b344fb4
[model] remove npu sdpa patch (#9368)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-10-30 16:26:35 +08:00
Kingsley
3057db15c3
[readme] upd mcore readme (#9352) 2025-10-27 21:23:31 +08:00
Kingsley
13170577b2
[feat] support megatron-LM training by mcore_adapter (#9237)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-10-26 16:21:30 +08:00
Xiaosu Zhu
129e918106
[data] Fix Qwen3VL plugin (#9297)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
2025-10-26 16:07:04 +08:00
Yaowei Zheng
9c0d033a15
[model] add qwen3vl 2b & 32b (#9343) 2025-10-24 13:22:36 +08:00
Yaowei Zheng
2a822178de
[deps] fix yanked packages (#9333) 2025-10-22 20:54:51 +08:00
Kingsley
b842457ef4
[ci] revert mac os ci setup (#9316) 2025-10-21 18:26:12 +08:00
魅影
2c6aded5d4
[v1] kernel plugin (#9274)
Co-authored-by: frozenleaves <frozen@Mac.local>
2025-10-18 18:02:14 +08:00
Yaowei Zheng
d9d67ba62d
[misc] fix import error (#9299) 2025-10-17 17:46:27 +08:00
Yaowei Zheng
a442fa90ad
[misc] fix import error (#9296) 2025-10-17 10:54:30 +08:00
wyfdgg
8c341cbaae
[model] support hunyuan-mt model (#9284)
Co-authored-by: wyfdgg <liwenkun0812@163.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-10-17 10:33:09 +08:00
Yaowei Zheng
47a7dc1698
[deps] upgrade vllm (#9293) 2025-10-16 23:20:26 +08:00
Yaowei Zheng
1037f63311
[model] add qwen3vl 4b + 8b (#9275) 2025-10-15 15:00:36 +08:00
Ximing Xing
c867e28093
[model] adds semantic initialization support for special tokens (#9267)
Co-authored-by: ximingxing <ximingxing@tencent.com>
2025-10-14 17:00:48 +08:00
Peter-Hamster
3dbca4b533
[data] add new reason tool calls demo data (#9249)
Co-authored-by: unknown <Peter Zeng>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-10-13 17:16:47 +08:00
Yaowei Zheng
9d1acbc191
[ci] fix ci (#9265) 2025-10-13 16:24:40 +08:00
Yaowei Zheng
52e46e162e
[v1] add data converter (#9263) 2025-10-13 15:54:47 +08:00
Jiayi Mao
48974783da
[model]: add ernie4_5_moe support for DeepSpeed Zero3 training (#9262) 2025-10-13 13:13:31 +08:00
Yaowei Zheng
575e4099df
[misc] add qwen bench script (#9259)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-13 11:45:25 +08:00
Yaowei Zheng
9687b71d3a
[v1] init data plugins (#9248)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-09 22:36:48 +08:00
Yaowei Zheng
1c35db60d6
[v1] support read dataset (#9243) 2025-10-09 17:16:33 +08:00
Yaowei Zheng
10146029ba
[v1] add v1 launcher (#9236)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-10-07 22:34:48 +08:00
Wu Wenhao
95b7188090
Merge commit from fork
* fix lfi and ssrf

* move utils to common

---------

Co-authored-by: d3do <chamlinx@outlook.com>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-10-07 20:55:29 +08:00
Yaowei Zheng
d5bb4e6394
[assets] update readme (#9232) 2025-10-05 16:42:19 +08:00
Yaowei Zheng
3fe6f0febd
[ci] update docker workflow (#9231) 2025-10-05 02:50:55 +08:00