LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-07-30 20:56:10 +08:00

Author	SHA1	Message	Date
Vo Van Phuc	5cfd804b59	[refactor] rename lfm template to lfm2 and add LFM 2.5 to README (#9731 )	2026-01-07 19:25:04 +08:00
Vo Van Phuc	958fb523a2	[model] support LiquidAI's LFM2.5-VL vision-language model (#9729 )	2026-01-07 17:20:29 +08:00
Vo Van Phuc	b4e051bea4	[model] support for LiquidAI's LFM2.5 (Liquid Foundation Models) (#9726 )	2026-01-07 14:14:47 +08:00
Yaowei Zheng	d22de0d4bf	[v1] add renderer ut (#9722 )	2026-01-07 02:06:07 +08:00
Yaowei Zheng	ea0b4e2466	[v1] add cli sampler (#9721 )	2026-01-06 23:31:27 +08:00
Yaowei Zheng	f60a6e3d01	[v1] add init plugin (#9716 )	2026-01-04 20:51:46 +08:00
浮梦	1857fbdd6b	[ci] add cuda workflow (#9682 ) Co-authored-by: frozenleaves <frozen@Mac.local> Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-12-29 20:03:00 +08:00
Copilot	eceec8ab69	[deps] goodbye python 3.9 (#9677 ) Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-12-27 02:50:44 +08:00
Yaowei Zheng	a754604c11	[misc] fix accelerator (#9661 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-25 02:11:04 +08:00
ZIYI ZENG	b0d49e137f	[misc] Support split eval_dataset when explict set "predict_with_generate" (#9604 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-20 01:46:00 +08:00
浮梦	18c21bce5a	[test] add allreduce test on npu (#9619 ) Co-authored-by: frozenleaves <frozen@Mac.local>	2025-12-16 21:33:30 +08:00
Yaowei Zheng	aeda079014	[v1] model loader (#9613 )	2025-12-14 11:50:52 +08:00
Yaowei Zheng	5d56817e2b	[misc] lint (#9593 ) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-09 18:00:35 +08:00
tangefly	739954910a	[deps] Update for Transformers v5 (#9569 )	2025-12-08 01:13:32 +08:00
Username_Full	e43a972b25	[test] add npu test yaml and add ascend a3 docker file (#9547 ) Co-authored-by: jiaqiw09 <jiaqiw960714@gmail.com>	2025-11-30 09:37:08 +08:00
jiaqiw09	0aa4a051af	[test] support slow skip and device skip in Uts (#9484 )	2025-11-13 20:08:22 +08:00
Yaowei Zheng	af8437095a	[ci] Change macOS version (#9229 )	2025-10-05 02:18:30 +08:00
Yaowei Zheng	6ffebe5ff7	[data] fix qwen omni plugin (#9204 ) Co-authored-by: kingsley <kingsleydodonow@gmail.com>	2025-09-28 01:02:29 +08:00
xvxuopop	0761a4448f	[model] add qwen3-vl/qwen3-omni (#9196 ) Co-authored-by: kingsley <kingsleydodonow@gmail.com>	2025-09-27 01:21:47 +08:00
Kingsley	7e710c6d3e	[misc] update InternVL constants (#9046 )	2025-08-29 13:30:28 +08:00
Yaowei Zheng	4dfad24902	[model] add gpt oss (#8826 )	2025-08-06 05:56:46 +08:00
Yaowei Zheng	4b0ec83928	[deps] bump transformers to 4.49.0 (#8564 )	2025-07-07 20:31:50 +08:00
Yaowei Zheng	906b31fd47	[assets] update readme (#8529 )	2025-07-02 17:42:27 +08:00
Liu Jiajun	4f0da0aec9	[data] fix gemma2 eos token (#8480 ) Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-06-27 18:19:15 +08:00
Yaowei Zheng	3a3bae1cfe	[data] fix qwen2vl pos ids (#8387 )	2025-06-17 00:48:54 +08:00
Kingsley	212a8006dc	[tests] add visual model save test (#8248 ) Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-06-05 20:38:01 +08:00
hoshi-hiyouga	ba032828e2	[deps] upgrade transformers (#8159 )	2025-05-26 22:03:58 +08:00
hoshi-hiyouga	9ae17cd173	[deps] update to transformers 4.52 (#8125 )	2025-05-21 05:16:18 +08:00
hoshi-hiyouga	56926d76f9	[data] llama3 multi tool support (#8124 )	2025-05-21 02:01:12 +08:00
hoshi-hiyouga	9b5baa97f0	[data] qwen3 fixes (#8109 )	2025-05-20 02:00:30 +08:00
Saiya	ab41f7956c	[infer] support lora adapter for SGLang backend (#8067 )	2025-05-16 23:33:47 +08:00
hoshi-hiyouga	052ca871bd	[data] optimize qwen3 loss computation (#7923 )	2025-04-30 16:18:00 +08:00
hoshi-hiyouga	98f23c6584	[model] add qwen3 (#7885 )	2025-04-29 09:34:05 +08:00
Kingsley	db9559456c	[data] fix qwen2.5 omni template (#7883 )	2025-04-29 00:58:23 +08:00
Kingsley	fa0eb91f1f	[data] fix internvl plugin (#7817 )	2025-04-23 00:58:22 +08:00
Kingsley	7500e761d3	[misc] update internvl constants (#7801 )	2025-04-22 15:53:08 +08:00
hoshi-hiyouga	b07628dea5	[example] add bash usage (#7794 )	2025-04-22 00:25:51 +08:00
hoshi-hiyouga	416853dd25	[parser] support omegaconf (#7793 )	2025-04-21 23:30:30 +08:00
hoshi-hiyouga	39169986ef	[trainer] fix pt loss (#7748 ) * fix pt loss * robust * fix * test	2025-04-17 03:15:35 +08:00
hoshi-hiyouga	86ebb219d6	[breaking] bump transformers to 4.45.0 & improve ci (#7746 ) * update ci * fix * fix * fix * fix * fix	2025-04-17 02:36:48 +08:00
Kingsley	2e518f255f	[model] support intern-VL 2.5-3 series (#7258 ) * add internvl and rebase * fix for internvl2&3 * remove lines * fix video_inputs & lint * nit * add constants * remove lines * fix * fix error * pass ci * pass ci * skip internvl & nit	2025-04-17 00:31:30 +08:00
hoshi-hiyouga	c3c0efbaa0	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	831e7f1cfd	[model] add llama4 (#7611 )	2025-04-06 13:42:31 +08:00
Kingsley	8da1d2fa71	[data] fix pixtral plugin (#7505 ) * preserve `image_sizes` * add comments	2025-03-27 17:06:40 +08:00
hoshi-hiyouga	0583d06676	[model] add qwen2vl 32b & upgrade peft (#7469 ) * add qwen2vl 32b * fix ci * upgrade peft to 0.15 * fix ci * fix ci	2025-03-25 12:15:58 +08:00
hoshi-hiyouga	3aa4f32e9c	[misc] fix ci (#7441 ) * fix ci * improve ci	2025-03-23 21:09:35 +08:00
Qiaolin Yu	a44a53ebec	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	93e6184cbe	[data] gemma3 plugin pan and scan (#7294 ) * gemma3 pan and scan * add test case * fix test	2025-03-13 23:29:23 +08:00
hoshi-hiyouga	650a9a9057	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	264538cb26	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00

1 2 3 4

194 Commits