LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-07-28 03:36:10 +08:00

Author	SHA1	Message	Date
Hertz	4923f52a28	[model] support MiMo-V2-Flash model (#9637 )	2025-12-21 14:38:18 +08:00
Yaowei Zheng	0894b4f37e	[misc] lint (#9636 )	2025-12-20 16:19:39 +08:00
ZIYI ZENG	b0d49e137f	[misc] Support split eval_dataset when explict set "predict_with_generate" (#9604 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-20 01:46:00 +08:00
Xunpeng Xiao	ddd7dcc722	[data] Fix the video frame sampling issue #9620 (#9634 )	2025-12-19 18:36:31 +08:00
浮梦	5204cd2bca	[misc] add version check for moe (#9633 )	2025-12-19 14:57:37 +08:00
Xunpeng Xiao	8c74dca76a	[feat] Models trained and inferred with FP8 are dequantized by default (#9627 )	2025-12-18 22:54:35 +08:00
xvxuopop	e8deda53a1	[example] add Qwen3 series examples (#9624 ) Co-authored-by: UsernameFull <tohowtodoit@gmail.com>	2025-12-18 21:27:00 +08:00
mrhaoxx	a769fb94b9	[feat] support ktransformers for dpo (#9621 ) Co-authored-by: poryfly <porykid@gmail.com>	2025-12-18 21:26:25 +08:00
mrhaoxx	964569751f	[kt] refactor ktransformers integration (#9632 )	2025-12-18 21:26:04 +08:00
Hertz	9fd4b094d4	[model] support VibeThinker models (#9616 )	2025-12-16 21:50:46 +08:00
浮梦	18c21bce5a	[test] add allreduce test on npu (#9619 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-12-16 21:33:30 +08:00
sunyi0505	a0179772ab	[example] add deepspeed autotp config and example (#9602 )	2025-12-15 15:15:26 +08:00
Yaowei Zheng	aeda079014	[v1] model loader (#9613 )	2025-12-14 11:50:52 +08:00
Xunpeng Xiao	fdd24276ed	[feat] support new function call value (#9610 ) Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-12-14 00:20:33 +08:00
Yaowei Zheng	110d21713e	[v1] add dp & mp mesh (#9611 )	2025-12-13 01:44:28 +08:00
Yaowei Zheng	203069e11c	[v1] add accelerator (#9607 )	2025-12-12 19:22:06 +08:00
tangefly	4fd94141a4	[model] Add Ministral3 (#9582 ) Co-authored-by: kingsley <kingsleydodonow@gmail.com>	2025-12-10 15:57:24 +08:00
Kingsley	22d6ac29d5	[model] Rename GLMV template (#9595 )	2025-12-10 13:27:47 +08:00
DoubleWheat	cff4483392	[config] Fix RoPE scaling patch for resuming from a scaled model (#9588 )	2025-12-09 20:37:37 +08:00
Yaowei Zheng	5d56817e2b	[misc] lint (#9593 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-09 18:00:35 +08:00
Yaowei Zheng	1bbb461f76	[assets] update readme (#9587 )	2025-12-09 12:22:54 +08:00
Hertz	c1f5f8fff6	[model] support GLM4.6v (#9586 )	2025-12-09 11:06:42 +08:00
Yaowei Zheng	5744f1ea94	[v1] add models & accelerator (#9579 )	2025-12-08 02:30:25 +08:00
tangefly	739954910a	[deps] Update for Transformers v5 (#9569 )	2025-12-08 01:13:32 +08:00
xvxuopop	109162dc56	[fix] fix the issue when using fsdp2 with gradient checkpointing. (#9541 ) Co-authored-by: jin-yongxu <jinyongxu@h-partners.com>	2025-12-06 16:04:51 +08:00
jiaqiw09	165f3f073a	[examples] add fsdp config for mutiple nodes (#9575 ) Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-12-05 23:22:48 +08:00
jiaqiw09	efb13b7483	[V1] Refactor ascend MoE kernel patch logic & Support Qwen3-MoE (#9557 )	2025-12-02 00:22:03 +08:00
Username_Full	e43a972b25	[test] add npu test yaml and add ascend a3 docker file (#9547 ) Co-authored-by: jiaqiw09 <jiaqiw960714@gmail.com>	2025-11-30 09:37:08 +08:00
Kingsley	22be45c78c	[misc] fix omni thinker load (#9552 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-11-30 09:36:36 +08:00
浮梦	d1f585f80a	[test] update test cmd (#9544 ) Co-authored-by: frozenleaves <frozen@Mac.local> Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-11-27 17:59:42 +08:00
xvxuopop	955396e8a5	[example] correct the parameter errors in the examples file. (#9543 )	2025-11-27 17:38:38 +08:00
xvxuopop	231756a5bf	[chat] fix the error when the vLLM version is greater than 0.10.0 (#9539 ) Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-11-27 02:14:53 +08:00
xvxuopop	2c4fb3c97e	[v1] Support fused moe kernel for qwen3vlmoe model. (#9532 )	2025-11-27 02:13:33 +08:00
浮梦	2b6f16f261	[model] temporarily support npu fused options on v0, powered by v1 kernels (#9520 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-11-27 02:08:36 +08:00
浮梦	f17efde693	[v1] support automatic discovery of registered kernels. (#9509 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-11-27 01:47:22 +08:00
Hertz	591fc9ed02	[model] support ERNIE-4.5-VL Models (#9521 )	2025-11-24 16:48:06 +08:00
Peilin Li	3140c242f0	[assets] add README with KT+llamafactory (#9514 )	2025-11-19 16:50:45 +08:00
Peilin Li	887c562d60	[example] Add KTransformers Qwen3MoE example (#9511 ) Co-authored-by: unknown <xiongchenhui@hisense.ad> Co-authored-by: Kingsley <kingsleydodonow@gmail.com>	2025-11-19 00:53:28 +08:00
Edge-Seven	9779b1f361	[misc] fix typos in some files (#9505 ) Co-authored-by: khanhkhanhlele <namkhanh20xx@gmail.com>	2025-11-18 20:36:01 +08:00
Yinlei Sun	45f0437a14	[v1] Add support for ShareGPT format. (#9486 )	2025-11-18 13:44:08 +08:00
浮梦	d4e120423d	[data] fix qwen3omni moe model (#9501 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-11-18 13:43:22 +08:00
Pory	10a446e373	[model] ktransformers qwen3 support (#9485 ) Co-authored-by: unknown <xiongchenhui@hisense.ad>	2025-11-13 20:09:44 +08:00
jiaqiw09	0aa4a051af	[test] support slow skip and device skip in Uts (#9484 )	2025-11-13 20:08:22 +08:00
Yaowei Zheng	8173a88a26	[assets] update readme (#9477 )	2025-11-12 16:15:41 +08:00
Kingsley	fef86fa7fe	[data] fix qwen3omni audio length calculation (#9467 )	2025-11-12 10:37:15 +08:00
taohongsheng	5afa851f71	[misc] Modify pip install command for huggingface_hub (#9463 )	2025-11-10 23:04:00 +08:00
MyungHa Kwon	a711bce664	[data] add openai format (#9449 )	2025-11-06 20:10:20 +08:00
魅影	bd24350cbf	[v1] add pair data converter (#9360 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-11-06 14:05:58 +08:00
Peilin Li	bd30c0003b	[train] fix denominator of ga in ksft loss (#9409 )	2025-11-05 20:53:23 +08:00
魅影	8edd2622ce	[docker] update npu dockerfile (#9407 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-11-05 18:28:32 +08:00

... 3 4 5 6 7 ...

3077 Commits