Kingsley
|
a3d44e3152
|
[mca] support qwen3.5 (#10265)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2026-03-10 10:55:16 +08:00 |
|
Hertz
|
c0245c43fc
|
[model] support Qwen3.5 all series models (#10237)
Co-authored-by: gatilin <gatilin@tencent.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-03-03 17:34:59 +08:00 |
|
Yaowei Zheng
|
b5cb7cb0e6
|
[misc] fix constants (#10232)
|
2026-03-02 11:10:48 +08:00 |
|
娄宗志
|
589da21d32
|
[model] support Aeva (#10214)
|
2026-02-26 23:03:13 +08:00 |
|
Yaowei Zheng
|
122cd46084
|
[model] update constants (#10220)
|
2026-02-26 21:13:56 +08:00 |
|
浮梦
|
2b8b871475
|
[model] Adapt Qwen3.5 (#10213)
Co-authored-by: frozenleaves <frozen@Mac.local>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-02-26 20:45:02 +08:00 |
|
Kingsley
|
a0f3ad0cee
|
[mca] update supported models (#10196)
|
2026-02-20 22:02:49 +08:00 |
|
Xue Yadong
|
d3ebd5678d
|
[model] support GLM-OCR SFT (#10183)
|
2026-02-10 21:41:01 +08:00 |
|
Shanay Mehta
|
ea644d04ec
|
[model] support GLM-4.7-Flash SFT (#10173)
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
|
2026-02-09 10:40:44 +08:00 |
|
Hertz
|
8bedfafa4e
|
[model] support MiniCPM-o-4.5 (#10163)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-02-04 23:21:27 +08:00 |
|
Yaowei Zheng
|
1a02717fa8
|
[assets] update readme (#10159)
|
2026-02-03 19:11:15 +08:00 |
|
ゆり
|
e7cb145f5d
|
[logging] Fix race condition in LoggerHandler during multi-GPU training (#10156)
Co-authored-by: yurekami <yurekami@users.noreply.github.com>
|
2026-02-03 11:14:07 +08:00 |
|
Hertz
|
b53d7037c2
|
[model] support youtu-vl model (#10152)
|
2026-02-02 21:42:43 +08:00 |
|
浮梦
|
bf04ca6af8
|
[deps] adapt to transformers v5 (#10147)
Co-authored-by: frozenleaves <frozen@Mac.local>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2026-02-02 12:07:19 +08:00 |
|
xvxuopop
|
762b480131
|
[feature] support using ray.remote to start distributed training. (#10109)
|
2026-01-28 16:05:29 +08:00 |
|
Kingsley
|
db2f794f7b
|
[misc] update mcore related docker and mca supported models (#10114)
|
2026-01-19 14:55:16 +08:00 |
|
Hertz
|
4d3621e3d3
|
[model] fixed&added Hunyuan models (#9750)
|
2026-01-12 01:15:00 +08:00 |
|
Hertz
|
15b87f3125
|
[model] support HY-MT model (#9746)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2026-01-11 16:25:56 +08:00 |
|
Yaowei Zheng
|
5cccaeec82
|
[model] clean obsolete models (#9736)
|
2026-01-09 16:12:07 +08:00 |
|
Jackey
|
5fb5d7ebd3
|
[model] support for microsoft's Phi-4-mini (#9734)
|
2026-01-09 12:24:45 +08:00 |
|
Vo Van Phuc
|
5cfd804b59
|
[refactor] rename lfm template to lfm2 and add LFM 2.5 to README (#9731)
|
2026-01-07 19:25:04 +08:00 |
|
Vo Van Phuc
|
958fb523a2
|
[model] support LiquidAI's LFM2.5-VL vision-language model (#9729)
|
2026-01-07 17:20:29 +08:00 |
|
Vo Van Phuc
|
b4e051bea4
|
[model] support for LiquidAI's LFM2.5 (Liquid Foundation Models) (#9726)
|
2026-01-07 14:14:47 +08:00 |
|
Hertz
|
9ae62c6fc0
|
[model] support Youtu-LLM-2B (#9707)
|
2026-01-04 13:17:57 +08:00 |
|
Yaowei Zheng
|
6fe6bd290b
|
[misc] set dev version (#9703)
|
2025-12-31 23:41:40 +08:00 |
|
Yaowei Zheng
|
95ac3f2373
|
[release] Bye 2025 (#9702)
|
2025-12-31 22:22:40 +08:00 |
|
Username_Full
|
000526908a
|
[core deps] upgrade TRL to be between 0.18 and 0.24 (#9617)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-12-31 20:54:27 +08:00 |
|
Kingsley
|
bb1ba31005
|
[misc] lint mca code (#9692)
|
2025-12-29 11:44:38 +08:00 |
|
Hertz
|
c107cc22d0
|
[model] support MiniMax-M1&M2 series (#9680)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-12-28 19:02:05 +08:00 |
|
Copilot
|
eceec8ab69
|
[deps] goodbye python 3.9 (#9677)
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-12-27 02:50:44 +08:00 |
|
Yaowei Zheng
|
55590f5ece
|
[misc] fix ci with uv (#9676)
|
2025-12-27 01:39:13 +08:00 |
|
Yaowei Zheng
|
84485406b7
|
[ci] disable pip cache for ci (#9654)
|
2025-12-23 18:37:40 +08:00 |
|
Yaowei Zheng
|
6ef9854713
|
[misc] fix cache & pin transformers to 4.57.1 (#9638)
|
2025-12-22 00:20:55 +08:00 |
|
Hertz
|
4923f52a28
|
[model] support MiMo-V2-Flash model (#9637)
|
2025-12-21 14:38:18 +08:00 |
|
Hertz
|
9fd4b094d4
|
[model] support VibeThinker models (#9616)
|
2025-12-16 21:50:46 +08:00 |
|
Yaowei Zheng
|
aeda079014
|
[v1] model loader (#9613)
|
2025-12-14 11:50:52 +08:00 |
|
tangefly
|
4fd94141a4
|
[model] Add Ministral3 (#9582)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-12-10 15:57:24 +08:00 |
|
Kingsley
|
22d6ac29d5
|
[model] Rename GLMV template (#9595)
|
2025-12-10 13:27:47 +08:00 |
|
Hertz
|
c1f5f8fff6
|
[model] support GLM4.6v (#9586)
|
2025-12-09 11:06:42 +08:00 |
|
Hertz
|
591fc9ed02
|
[model] support ERNIE-4.5-VL Models (#9521)
|
2025-11-24 16:48:06 +08:00 |
|
Yaowei Zheng
|
eaf963f67f
|
[model] update kt code (#9406)
|
2025-11-05 15:27:22 +08:00 |
|
魅影
|
14abb75126
|
[model] enable using FA in npu (#9397)
Co-authored-by: frozenleaves <frozen@Mac.local>
|
2025-11-04 19:32:30 +08:00 |
|
Peilin Li
|
934b3084ee
|
[train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-11-04 15:54:12 +08:00 |
|
Yaowei Zheng
|
3ae15da9c0
|
[misc] lint code (#9395)
|
2025-11-03 22:08:59 +08:00 |
|
Kingsley
|
13170577b2
|
[feat] support megatron-LM training by mcore_adapter (#9237)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-26 16:21:30 +08:00 |
|
Yaowei Zheng
|
9c0d033a15
|
[model] add qwen3vl 2b & 32b (#9343)
|
2025-10-24 13:22:36 +08:00 |
|
Yaowei Zheng
|
d9d67ba62d
|
[misc] fix import error (#9299)
|
2025-10-17 17:46:27 +08:00 |
|
wyfdgg
|
8c341cbaae
|
[model] support hunyuan-mt model (#9284)
Co-authored-by: wyfdgg <liwenkun0812@163.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-10-17 10:33:09 +08:00 |
|
Yaowei Zheng
|
1037f63311
|
[model] add qwen3vl 4b + 8b (#9275)
|
2025-10-15 15:00:36 +08:00 |
|
Yaowei Zheng
|
10146029ba
|
[v1] add v1 launcher (#9236)
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-10-07 22:34:48 +08:00 |
|