423A35C7
  • Joined on 2024-05-11
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-11-28 03:24:19 +08:00
33824be3cb version 0.7.9
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-27 18:54:19 +08:00
d1f585f80a [test] update test cmd (#9544)
955396e8a5 [example] correct the parameter errors in the examples file. (#9543)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-27 02:34:19 +08:00
231756a5bf [chat] fix the error when the vLLM version is greater than 0.10.0 (#9539)
2c4fb3c97e [v1] Support fused moe kernel for qwen3vlmoe model. (#9532)
2b6f16f261 [model] temporarily support npu fused options on v0, powered by v1 kernels (#9520)
f17efde693 [v1] support automatic discovery of registered kernels. (#9509)
Compare 4 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-24 17:24:18 +08:00
591fc9ed02 [model] support ERNIE-4.5-VL Models (#9521)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-19 23:04:22 +08:00
3140c242f0 [assets] add README with KT+llamafactory (#9514)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-19 06:46:35 +08:00
887c562d60 [example] Add KTransformers Qwen3MoE example (#9511)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-18 22:36:34 +08:00
9779b1f361 [misc] fix typos in some files (#9505)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-18 14:26:35 +08:00
45f0437a14 [v1] Add support for ShareGPT format. (#9486)
d4e120423d [data] fix qwen3omni moe model (#9501)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-14 01:04:36 +08:00
10a446e373 [model] ktransformers qwen3 support (#9485)
0aa4a051af [test] support slow skip and device skip in Uts (#9484)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-12 16:24:33 +08:00
8173a88a26 [assets] update readme (#9477)
fef86fa7fe [data] fix qwen3omni audio length calculation (#9467)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-10 23:34:34 +08:00
5afa851f71 [misc] Modify pip install command for huggingface_hub (#9463)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-07 03:12:13 +08:00
a711bce664 [data] add openai format (#9449)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-06 19:02:17 +08:00
bd24350cbf [v1] add pair data converter (#9360)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-06 02:42:15 +08:00
bd30c0003b [train] fix denominator of ga in ksft loss (#9409)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-05 18:32:14 +08:00
8edd2622ce [docker] update npu dockerfile (#9407)
eaf963f67f [model] update kt code (#9406)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-05 02:12:14 +08:00
56f45e826f [train] fix MPO re-weight (#9405)
14abb75126 [model] enable using FA in npu (#9397)
5a9939050e [model] add deepstack_merger_list to Qwen3-VL vision_model_keys (#9399)
Compare 3 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-04 18:02:19 +08:00
934b3084ee [train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-04 01:42:14 +08:00
3ae15da9c0 [misc] lint code (#9395)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-03 17:32:15 +08:00
215580c77d [data] fix mm pluigin for qwen omni video training (#9388)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-30 23:48:09 +08:00
767b344fb4 [model] remove npu sdpa patch (#9368)