Commit Graph

  • 9829ae0a77 [ci] using mp to run kernel test (#9754) main 浮梦 2026-01-13 19:43:59 +08:00
  • 958b9c3468 [v1] add sft (#9752) Yaowei Zheng 2026-01-12 03:15:01 +08:00
  • 4d3621e3d3 [model] fixed&added Hunyuan models (#9750) Hertz 2026-01-12 01:15:00 +08:00
  • a296723697 [v1] upgrade batching (#9751) Yaowei Zheng 2026-01-12 00:21:36 +08:00
  • 15b87f3125 [model] support HY-MT model (#9746) Hertz 2026-01-11 16:25:56 +08:00
  • 9f73a6eb23 [deps] fix package (#9745) Yaowei Zheng 2026-01-10 04:27:53 +08:00
  • b2effbd77c [v1] add batch generator (#9744) Yaowei Zheng 2026-01-10 04:24:09 +08:00
  • d7d734d54c [misc] fix fp8 (#9742) Yaowei Zheng 2026-01-09 16:17:26 +08:00
  • 8abb8fb533 [v1] use async streamer (#9741) Yaowei Zheng 2026-01-09 16:07:40 +08:00
  • 766d5ae6ad [ci] fix workflow (#9738) Yaowei Zheng 2026-01-09 14:48:16 +08:00
  • 5cccaeec82 [model] clean obsolete models (#9736) Yaowei Zheng 2026-01-09 14:08:18 +08:00
  • 5fb5d7ebd3 [model] support for microsoft's Phi-4-mini (#9734) Jackey 2026-01-09 12:24:45 +08:00
  • 03a70ba8dd [fix] correct ktransformers example config paths and templates (#9732) Peilin Li 2026-01-08 10:52:50 +08:00
  • 5cfd804b59 [refactor] rename lfm template to lfm2 and add LFM 2.5 to README (#9731) Vo Van Phuc 2026-01-07 18:25:04 +07:00
  • 4c1eb922e2 [misc] fix parser (#9730) Yaowei Zheng 2026-01-07 17:36:08 +08:00
  • 958fb523a2 [model] support LiquidAI's LFM2.5-VL vision-language model (#9729) Vo Van Phuc 2026-01-07 16:20:29 +07:00
  • b4e051bea4 [model] support for LiquidAI's LFM2.5 (Liquid Foundation Models) (#9726) Vo Van Phuc 2026-01-07 13:14:47 +07:00
  • d43e1007e8 [ci] improve cuda ci cache (#9725) 浮梦 2026-01-07 12:34:40 +08:00
  • f89d9367e5 [assets] update README.md (#9724) Xunpeng Xiao 2026-01-07 12:11:50 +08:00
  • d22de0d4bf [v1] add renderer ut (#9722) Yaowei Zheng 2026-01-07 02:06:07 +08:00
  • ea0b4e2466 [v1] add cli sampler (#9721) Yaowei Zheng 2026-01-06 23:31:27 +08:00
  • e944dc442c [feature] add support for EAFT loss (#9720) yanglele 2026-01-06 23:07:12 +08:00
  • 68119e5522 [misc] Add a PyTorch version warning for Conv3D. (#9715) Xunpeng Xiao 2026-01-05 13:26:29 +08:00
  • f60a6e3d01 [v1] add init plugin (#9716) Yaowei Zheng 2026-01-04 20:51:46 +08:00
  • 81b8a50aa5 [deps] Update pyproject.toml and requirements (#9714) jiaqiw09 2026-01-04 19:52:16 +08:00
  • 8600530002 [misc] lint (#9710) Yaowei Zheng 2026-01-04 13:47:56 +08:00
  • 9ae62c6fc0 [model] support Youtu-LLM-2B (#9707) Hertz 2026-01-04 13:17:57 +08:00
  • 0087bc253b [misc] Compatible with an empty architectures field in config.json (#9709) Xunpeng Xiao 2026-01-04 12:11:35 +08:00
  • 355d5c5e5a [fix] fp8: add Transformer Engine backend support (#9705) Santosh Bhavani 2025-12-31 18:18:02 -08:00
  • 6fe6bd290b [misc] set dev version (#9703) Yaowei Zheng 2025-12-31 23:41:40 +08:00
  • 95ac3f2373 [release] Bye 2025 (#9702) v0.9.4 Yaowei Zheng 2025-12-31 22:22:40 +08:00
  • 000526908a [core deps] upgrade TRL to be between 0.18 and 0.24 (#9617) Username_Full 2025-12-31 20:54:27 +08:00
  • c8d7e85b3e [fix] Fix prediction metrics in scripts/vllm_infer.py to match Transformers (#9701) fivehaitao 2025-12-31 18:30:00 +08:00
  • 16735b9e35 [v1] Refactor kernel plugin (#9669) 浮梦 2025-12-31 18:26:48 +08:00
  • 4e1d69579a [data] add DLR-Web dataset for supervised fine-tuning (#9696) Weize Liu 2025-12-30 07:50:38 -05:00
  • 1857fbdd6b [ci] add cuda workflow (#9682) 浮梦 2025-12-29 20:03:00 +08:00
  • bb1ba31005 [misc] lint mca code (#9692) Kingsley 2025-12-29 11:44:38 +08:00
  • e97d0474fb [ci] Fix NPU device condition in docker workflow (#9688) Copilot 2025-12-28 20:04:59 +08:00
  • 3f0c3dc84d [assets] fix installation (#9687) Yaowei Zheng 2025-12-28 19:29:28 +08:00
  • c107cc22d0 [model] support MiniMax-M1&M2 series (#9680) Hertz 2025-12-28 19:02:05 +08:00
  • 7ef1fba34a [version] fix gradio (#9685) Yaowei Zheng 2025-12-28 05:00:51 +08:00
  • eceec8ab69 [deps] goodbye python 3.9 (#9677) Copilot 2025-12-27 02:50:44 +08:00
  • b44f651e09 [ci] fix docker (#9678) Yaowei Zheng 2025-12-27 02:43:46 +08:00
  • 55590f5ece [misc] fix ci with uv (#9676) Yaowei Zheng 2025-12-27 01:39:13 +08:00
  • a1b1931b4a [breaking] migrate from setuptools to uv (#9673) Copilot 2025-12-26 22:47:23 +08:00
  • 3c17f2722c [model] Update ernie_vl to adapt new version (#9665) Xunpeng Xiao 2025-12-26 19:57:49 +08:00
  • a882e2d5fc [assets] Add GitHub Copilot instructions for repository (#9675) Copilot 2025-12-26 17:32:48 +08:00
  • a754604c11 [misc] fix accelerator (#9661) Yaowei Zheng 2025-12-25 02:11:04 +08:00
  • 6a2eafbae3 [feat] Models trained and inferred with Mxfp4 are dequantized by default (#9652) Xunpeng Xiao 2025-12-24 00:26:40 +08:00
  • 84485406b7 [ci] disable pip cache for ci (#9654) Yaowei Zheng 2025-12-23 18:37:40 +08:00
  • 1c8a42d2f8 [v1&WIP] dataloader init (#9645) Kingsley 2025-12-23 16:29:47 +08:00
  • 7901b2f32e [model] efficient tuning for gpt-oss (#9354) thulyubh22 2025-12-23 16:28:38 +08:00
  • 1f1f5a7d1b [ci] remove docker cache (#9640) Yaowei Zheng 2025-12-22 01:03:10 +08:00
  • 6ef9854713 [misc] fix cache & pin transformers to 4.57.1 (#9638) Yaowei Zheng 2025-12-22 00:20:55 +08:00
  • 4923f52a28 [model] support MiMo-V2-Flash model (#9637) Hertz 2025-12-21 14:38:18 +08:00
  • 0894b4f37e [misc] lint (#9636) Yaowei Zheng 2025-12-20 16:19:39 +08:00
  • b0d49e137f [misc] Support split eval_dataset when explict set "predict_with_generate" (#9604) ZIYI ZENG 2025-12-20 01:46:00 +08:00
  • ddd7dcc722 [data] Fix the video frame sampling issue #9620 (#9634) Xunpeng Xiao 2025-12-19 18:36:31 +08:00
  • 5204cd2bca [misc] add version check for moe (#9633) 浮梦 2025-12-19 14:57:37 +08:00
  • 8c74dca76a [feat] Models trained and inferred with FP8 are dequantized by default (#9627) Xunpeng Xiao 2025-12-18 22:54:35 +08:00
  • e8deda53a1 [example] add Qwen3 series examples (#9624) xvxuopop 2025-12-18 21:27:00 +08:00
  • a769fb94b9 [feat] support ktransformers for dpo (#9621) mrhaoxx 2025-12-18 21:26:25 +08:00
  • 964569751f [kt] refactor ktransformers integration (#9632) mrhaoxx 2025-12-18 21:26:04 +08:00
  • 9fd4b094d4 [model] support VibeThinker models (#9616) Hertz 2025-12-16 21:50:46 +08:00
  • 18c21bce5a [test] add allreduce test on npu (#9619) 浮梦 2025-12-16 21:33:30 +08:00
  • a0179772ab [example] add deepspeed autotp config and example (#9602) sunyi0505 2025-12-15 15:15:26 +08:00
  • aeda079014 [v1] model loader (#9613) Yaowei Zheng 2025-12-14 11:50:52 +08:00
  • fdd24276ed [feat] support new function call value (#9610) Xunpeng Xiao 2025-12-14 00:20:33 +08:00
  • 110d21713e [v1] add dp & mp mesh (#9611) Yaowei Zheng 2025-12-13 01:44:28 +08:00
  • 203069e11c [v1] add accelerator (#9607) Yaowei Zheng 2025-12-12 19:22:06 +08:00
  • 4fd94141a4 [model] Add Ministral3 (#9582) tangefly 2025-12-10 15:57:24 +08:00
  • 22d6ac29d5 [model] Rename GLMV template (#9595) Kingsley 2025-12-10 13:27:47 +08:00
  • cff4483392 [config] Fix RoPE scaling patch for resuming from a scaled model (#9588) DoubleWheat 2025-12-09 20:37:37 +08:00
  • 5d56817e2b [misc] lint (#9593) Yaowei Zheng 2025-12-09 18:00:35 +08:00
  • 1bbb461f76 [assets] update readme (#9587) Yaowei Zheng 2025-12-09 12:22:54 +08:00
  • c1f5f8fff6 [model] support GLM4.6v (#9586) Hertz 2025-12-09 11:06:42 +08:00
  • 5744f1ea94 [v1] add models & accelerator (#9579) Yaowei Zheng 2025-12-08 02:30:25 +08:00
  • 739954910a [deps] Update for Transformers v5 (#9569) tangefly 2025-12-08 01:13:32 +08:00
  • 109162dc56 [fix] fix the issue when using fsdp2 with gradient checkpointing. (#9541) xvxuopop 2025-12-06 16:04:51 +08:00
  • 165f3f073a [examples] add fsdp config for mutiple nodes (#9575) jiaqiw09 2025-12-05 23:22:48 +08:00
  • efb13b7483 [V1] Refactor ascend MoE kernel patch logic & Support Qwen3-MoE (#9557) jiaqiw09 2025-12-02 00:22:03 +08:00
  • e43a972b25 [test] add npu test yaml and add ascend a3 docker file (#9547) Username_Full 2025-11-30 09:37:08 +08:00
  • 22be45c78c [misc] fix omni thinker load (#9552) Kingsley 2025-11-30 09:36:36 +08:00
  • d1f585f80a [test] update test cmd (#9544) 浮梦 2025-11-27 17:59:42 +08:00
  • 955396e8a5 [example] correct the parameter errors in the examples file. (#9543) xvxuopop 2025-11-27 17:38:38 +08:00
  • 231756a5bf [chat] fix the error when the vLLM version is greater than 0.10.0 (#9539) xvxuopop 2025-11-27 02:14:53 +08:00
  • 2c4fb3c97e [v1] Support fused moe kernel for qwen3vlmoe model. (#9532) xvxuopop 2025-11-27 02:13:33 +08:00
  • 2b6f16f261 [model] temporarily support npu fused options on v0, powered by v1 kernels (#9520) 浮梦 2025-11-27 02:08:36 +08:00
  • f17efde693 [v1] support automatic discovery of registered kernels. (#9509) 浮梦 2025-11-27 01:47:22 +08:00
  • 591fc9ed02 [model] support ERNIE-4.5-VL Models (#9521) Hertz 2025-11-24 16:48:06 +08:00
  • 3140c242f0 [assets] add README with KT+llamafactory (#9514) Peilin Li 2025-11-19 16:50:45 +08:00
  • 887c562d60 [example] Add KTransformers Qwen3MoE example (#9511) Peilin Li 2025-11-19 00:53:28 +08:00
  • 9779b1f361 [misc] fix typos in some files (#9505) Edge-Seven 2025-11-18 19:36:01 +07:00
  • 45f0437a14 [v1] Add support for ShareGPT format. (#9486) Yinlei Sun 2025-11-18 13:44:08 +08:00
  • d4e120423d [data] fix qwen3omni moe model (#9501) 浮梦 2025-11-18 13:43:22 +08:00
  • 10a446e373 [model] ktransformers qwen3 support (#9485) Pory 2025-11-13 20:09:44 +08:00
  • 0aa4a051af [test] support slow skip and device skip in Uts (#9484) jiaqiw09 2025-11-13 20:08:22 +08:00
  • 8173a88a26 [assets] update readme (#9477) Yaowei Zheng 2025-11-12 16:15:41 +08:00
  • fef86fa7fe [data] fix qwen3omni audio length calculation (#9467) Kingsley 2025-11-12 10:37:15 +08:00
  • 5afa851f71 [misc] Modify pip install command for huggingface_hub (#9463) taohongsheng 2025-11-10 23:04:00 +08:00