423A35C7
  • Joined on 2024-05-11
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-18 22:36:34 +08:00
9779b1f361 [misc] fix typos in some files (#9505)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-18 14:26:35 +08:00
45f0437a14 [v1] Add support for ShareGPT format. (#9486)
d4e120423d [data] fix qwen3omni moe model (#9501)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-14 01:04:36 +08:00
10a446e373 [model] ktransformers qwen3 support (#9485)
0aa4a051af [test] support slow skip and device skip in Uts (#9484)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-12 16:24:33 +08:00
8173a88a26 [assets] update readme (#9477)
fef86fa7fe [data] fix qwen3omni audio length calculation (#9467)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-10 23:34:34 +08:00
5afa851f71 [misc] Modify pip install command for huggingface_hub (#9463)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-07 03:12:13 +08:00
a711bce664 [data] add openai format (#9449)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-06 19:02:17 +08:00
bd24350cbf [v1] add pair data converter (#9360)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-06 02:42:15 +08:00
bd30c0003b [train] fix denominator of ga in ksft loss (#9409)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-05 18:32:14 +08:00
8edd2622ce [docker] update npu dockerfile (#9407)
eaf963f67f [model] update kt code (#9406)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-05 02:12:14 +08:00
56f45e826f [train] fix MPO re-weight (#9405)
14abb75126 [model] enable using FA in npu (#9397)
5a9939050e [model] add deepstack_merger_list to Qwen3-VL vision_model_keys (#9399)
Compare 3 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-04 18:02:19 +08:00
934b3084ee [train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-04 01:42:14 +08:00
3ae15da9c0 [misc] lint code (#9395)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-11-03 17:32:15 +08:00
215580c77d [data] fix mm pluigin for qwen omni video training (#9388)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-30 23:48:09 +08:00
767b344fb4 [model] remove npu sdpa patch (#9368)
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-10-30 23:18:10 +08:00
2d4d345b6f Improve ball_query() runtime for large-scale cases (#2006)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-27 22:18:10 +08:00
3057db15c3 [readme] upd mcore readme (#9352)
423A35C7 synced commits to main at 423A35C7/pytorch3d from mirror 2025-10-27 21:48:10 +08:00
45df20e9e2 clang-format | Format fbsource with clang-format 21.
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-26 21:48:11 +08:00
13170577b2 [feat] support megatron-LM training by mcore_adapter (#9237)
129e918106 [data] Fix Qwen3VL plugin (#9297)
Compare 2 commits »
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-24 20:48:10 +08:00
9c0d033a15 [model] add qwen3vl 2b & 32b (#9343)
423A35C7 synced commits to main at 423A35C7/LLaMA-Factory from mirror 2025-10-23 03:58:10 +08:00
2a822178de [deps] fix yanked packages (#9333)