LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-29 10:10:35 +08:00

Author	SHA1	Message	Date
Yaowei Zheng	6f743571b1	[misc] move wechat out (#9223 )	2025-10-02 02:06:09 +08:00
Ben Feuer	29baefbc1a	[feat] fp8 training (#8960 ) Co-authored-by: Benjamin Feuer <penfever@gmail.com> Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-10-01 14:32:53 +08:00
Zeju Qiu	8efebab098	[feature] adding orthogononal finetuning (OFT) to llama factory (#8623 ) Co-authored-by: Zeju <zqiu@g003.internal.cluster.is.localnet> Co-authored-by: Zeju <zqiu@login2.is.localnet> Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-08-18 18:22:47 +08:00
XLXW	3cff2fd946	[feature] add support for dft loss (#8917 )	2025-08-15 23:29:57 +08:00
Yaowei Zheng	387454e524	[model] add gpt oss (#8826 )	2025-08-06 05:56:46 +08:00
Butui Hu	63a3d474b8	[launcher] Add elastic and fault-tolerant training support (#8286 ) Signed-off-by: Butui Hu <hot123tea123@gmail.com>	2025-06-05 16:40:03 +08:00
hoshi-hiyouga	8357d451b2	[example] update examples (#7964 )	2025-05-06 17:24:25 +02:00
hoshi-hiyouga	1250ff9575	[misc] fix uv (#7913 )	2025-04-30 07:45:03 +08:00
hoshi-hiyouga	cea9071ed1	[example] add bash usage (#7794 )	2025-04-22 00:25:51 +08:00
Juanxi Tian	5a02c5afc2	[trainer] Add Muon Optimizer (#7749 ) Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-21 23:38:37 +08:00
hoshi-hiyouga	3fc56b8499	[parser] support omegaconf (#7793 )	2025-04-21 23:30:30 +08:00
hoshi-hiyouga	06001ea2f0	[infer] set env for vllm ascend (#7745 )	2025-04-17 01:08:55 +08:00
leo-pony	98e6b3c0ca	[infer] support vllm-ascend (#7739 )	2025-04-16 20:06:47 +08:00
Eric Tang	5cc0d6a8f0	[ray] allow for specifying ray.init kwargs (i.e. runtime_env) (#7647 ) * ray init kwargs * Update trainer_utils.py * fix ray args --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-10 11:31:05 +08:00
hoshi-hiyouga	fb46193364	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	c0079ab9fd	[assets] update readme (#7612 )	2025-04-06 13:58:49 +08:00
hoshi-hiyouga	37d783149d	[model] fix kv cache (#7564 )	2025-04-01 23:07:46 +08:00
Qiaolin Yu	280d9bda76	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	c6331546a9	[config] update args (#7231 ) Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	678c65fc28	[misc] fix ds config (#7205 ) Former-commit-id: b478fa1d9de1858075769f86f57126fde92db813	2025-03-07 15:21:28 +08:00
hoshi-hiyouga	54bcc37f55	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
hoshi-hiyouga	a354df6d90	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916	2025-02-24 22:05:13 +08:00
hoshi-hiyouga	5deefc6094	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b7ccfd28d1	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	98b20233ae	[misc] update readme (#6917 ) Former-commit-id: 6bbed1d8c4189fb7bea40230e278c40bb5336fbd	2025-02-13 00:58:10 +08:00
Eric Tang	24ad208345	[example] fix path to ray example (#6906 ) Former-commit-id: e9bee3ef045d85051da04e6ad581a23a9e1a9551	2025-02-13 00:29:32 +08:00
hoshi-hiyouga	c5649d7149	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	ca5cd8276c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	01fa59bb6b	disable valset by default (#6690 ) Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533	2025-01-17 21:09:30 +08:00
hoshi-hiyouga	33d420bbcc	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a	2025-01-15 01:42:50 +08:00
zhuHQ	9b29a431db	[optim] add support to APOLLO (#6617 ) Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	7ab274eb67	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f	2025-01-13 21:34:20 +08:00
codingma	6def336d82	add nf4 qlora support on Ascend NPU (#6601 ) * add nf4 qlora support on Ascend NPU * add transformers version check * add python>=3.10 requirement description for npu * tiny fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 7912d1acac5f10dab22145fe729a90c57aad8d85	2025-01-13 19:43:36 +08:00
hiyouga	e49c021e22	refactor mllm param logic Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246	2025-01-10 15:45:48 +00:00
hiyouga	708e899769	refactor ray integration, support save ckpt Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2	2025-01-07 09:39:10 +00:00
Eric Tang	88e9badcbb	run style check Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	09a17b5415	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960	2025-01-07 08:55:44 +00:00
Yaser Afshar	eee39dd138	Add missing key to init_kwargs Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca	2024-12-17 12:34:05 +00:00
Yaser Afshar	45596f5ae0	Add trust_remote_code parameter and remove True - Introduced a new model parameter `trust_remote_code` - Set the default value of `trust_remote_code` to `False` to enhance security Former-commit-id: 4bf23f406cf5235c16f9f8139850c53354901814	2024-12-17 12:25:12 +00:00
hiyouga	ea1a17c46c	update assets Former-commit-id: 7b9bd552b2bf97b72976511094eb51dfde5d1017	2024-12-14 17:36:03 +00:00
hiyouga	b57560aa25	fix mrope Former-commit-id: 55bee1d333549ca19858b3f5c1b7b86926e5fb09	2024-12-12 15:08:17 +00:00
hiyouga	7a7631134f	support qwen2vl train proj only Former-commit-id: 0e949ef03455726e907c6f1039e93ebe480c897a	2024-12-05 10:37:42 +00:00
hiyouga	dc6b9c104f	update examples Former-commit-id: bcb010be7732ae137f156932100ee4d02a93725c	2024-12-05 08:48:25 +00:00
hiyouga	51b18e565d	support batch infer in vllm Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5	2024-12-04 13:50:00 +00:00
hiyouga	85343ddf47	add vllm config Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb	2024-11-10 21:28:18 +08:00
hiyouga	3c503cec89	update tests Former-commit-id: 4e92b656e324725048d914946e70867be20032ff	2024-11-02 12:41:44 +08:00
hiyouga	dbbfb5f5dc	use pre-commit Former-commit-id: 7cfede95df22a9ff236788f04159b6b16b8d04bb	2024-10-29 09:07:46 +00:00
hiyouga	5f10a1e6fe	add e2e tests Former-commit-id: 0156a37450604641c4f5f9756ad84324698fc88c	2024-09-05 21:52:28 +08:00
hiyouga	04db03bdfd	add rlhf-v dataset Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437	2024-09-01 22:57:41 +08:00
hiyouga	ec2da8b06a	remove visual_inputs, fix qlora Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a	2024-08-31 00:24:51 +08:00

1 2 3

147 Commits