LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-02-06 22:12:19 +08:00

Author	SHA1	Message	Date
Yaowei Zheng	4dfad24902	[model] add gpt oss (#8826 )	2025-08-06 05:56:46 +08:00
Butui Hu	1a33d65a56	[launcher] Add elastic and fault-tolerant training support (#8286 ) Signed-off-by: Butui Hu <hot123tea123@gmail.com>	2025-06-05 16:40:03 +08:00
hoshi-hiyouga	aa9ed4db59	[example] update examples (#7964 )	2025-05-06 17:24:25 +02:00
hoshi-hiyouga	73198a6645	[misc] fix uv (#7913 )	2025-04-30 07:45:03 +08:00
hoshi-hiyouga	b07628dea5	[example] add bash usage (#7794 )	2025-04-22 00:25:51 +08:00
Juanxi Tian	12ada72ed4	[trainer] Add Muon Optimizer (#7749 ) Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-21 23:38:37 +08:00
hoshi-hiyouga	416853dd25	[parser] support omegaconf (#7793 )	2025-04-21 23:30:30 +08:00
hoshi-hiyouga	d222f63cb7	[infer] set env for vllm ascend (#7745 )	2025-04-17 01:08:55 +08:00
leo-pony	b9263ff5ac	[infer] support vllm-ascend (#7739 )	2025-04-16 20:06:47 +08:00
Eric Tang	bb8d79bae2	[ray] allow for specifying ray.init kwargs (i.e. runtime_env) (#7647 ) * ray init kwargs * Update trainer_utils.py * fix ray args --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-10 11:31:05 +08:00
hoshi-hiyouga	c3c0efbaa0	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	5115dc8c7f	[assets] update readme (#7612 )	2025-04-06 13:58:49 +08:00
hoshi-hiyouga	2bfcad2394	[model] fix kv cache (#7564 )	2025-04-01 23:07:46 +08:00
Qiaolin Yu	a44a53ebec	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	71a1c1321a	[config] update args (#7231 ) Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	82a2bac866	[misc] fix ds config (#7205 ) Former-commit-id: b478fa1d9de1858075769f86f57126fde92db813	2025-03-07 15:21:28 +08:00
hoshi-hiyouga	7b985f55db	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
hoshi-hiyouga	c1d5073bd3	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916	2025-02-24 22:05:13 +08:00
hoshi-hiyouga	f5cd17881e	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	c09b648934	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	290057069e	[misc] update readme (#6917 ) Former-commit-id: 6bbed1d8c4189fb7bea40230e278c40bb5336fbd	2025-02-13 00:58:10 +08:00
Eric Tang	5a221d91f9	[example] fix path to ray example (#6906 ) Former-commit-id: e9bee3ef045d85051da04e6ad581a23a9e1a9551	2025-02-13 00:29:32 +08:00
hoshi-hiyouga	86063e27ea	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	88eafd865b	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	332f637592	disable valset by default (#6690 ) Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533	2025-01-17 21:09:30 +08:00
hoshi-hiyouga	7638f1070e	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a	2025-01-15 01:42:50 +08:00
zhuHQ	c2120432db	[optim] add support to APOLLO (#6617 ) Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	2a05941b14	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f	2025-01-13 21:34:20 +08:00
codingma	11c38b9173	add nf4 qlora support on Ascend NPU (#6601 ) * add nf4 qlora support on Ascend NPU * add transformers version check * add python>=3.10 requirement description for npu * tiny fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 7912d1acac5f10dab22145fe729a90c57aad8d85	2025-01-13 19:43:36 +08:00
hiyouga	dc65ecdf09	refactor mllm param logic Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246	2025-01-10 15:45:48 +00:00
hiyouga	944a2aec4d	refactor ray integration, support save ckpt Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2	2025-01-07 09:39:10 +00:00
Eric Tang	4f31ad997c	run style check Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	8683582300	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960	2025-01-07 08:55:44 +00:00
Yaser Afshar	6f1c8dacea	Add missing key to init_kwargs Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca	2024-12-17 12:34:05 +00:00
Yaser Afshar	8881237475	Add trust_remote_code parameter and remove True - Introduced a new model parameter `trust_remote_code` - Set the default value of `trust_remote_code` to `False` to enhance security Former-commit-id: 4bf23f406cf5235c16f9f8139850c53354901814	2024-12-17 12:25:12 +00:00
hiyouga	8c65548b10	update assets Former-commit-id: 7b9bd552b2bf97b72976511094eb51dfde5d1017	2024-12-14 17:36:03 +00:00
hiyouga	fb22651faf	fix mrope Former-commit-id: 55bee1d333549ca19858b3f5c1b7b86926e5fb09	2024-12-12 15:08:17 +00:00
hiyouga	bac2c64f87	support qwen2vl train proj only Former-commit-id: 0e949ef03455726e907c6f1039e93ebe480c897a	2024-12-05 10:37:42 +00:00
hiyouga	39865d8a1f	update examples Former-commit-id: bcb010be7732ae137f156932100ee4d02a93725c	2024-12-05 08:48:25 +00:00
hiyouga	c1768cfb14	support batch infer in vllm Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5	2024-12-04 13:50:00 +00:00
hiyouga	1e6f96508a	add vllm config Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb	2024-11-10 21:28:18 +08:00
hiyouga	ba66ac084f	update tests Former-commit-id: 4e92b656e324725048d914946e70867be20032ff	2024-11-02 12:41:44 +08:00
hiyouga	9bdba2f6a8	add e2e tests Former-commit-id: 0156a37450604641c4f5f9756ad84324698fc88c	2024-09-05 21:52:28 +08:00
hiyouga	60cf12727b	add rlhf-v dataset Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437	2024-09-01 22:57:41 +08:00
hiyouga	2f6fc27c8b	remove visual_inputs, fix qlora Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a	2024-08-31 00:24:51 +08:00
hiyouga	66a1abac6a	add examples Former-commit-id: 169c68921b1b8ac279834b060d9e7d38a56fe1aa	2024-08-30 21:43:19 +08:00
hiyouga	c62a6ca59d	refactor mm training Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a	2024-08-30 02:14:31 +08:00
simonJJJ	0f3d54d8a0	initial-commit Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928	2024-08-28 16:51:35 +08:00
hiyouga	47efcdb1dd	update examples Former-commit-id: d5c57c8b7f64afe8061045ec9689abbac45c1175	2024-08-09 20:13:46 +08:00
hiyouga	59cbce1a46	add adam_mini to readme Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11	2024-08-09 20:02:03 +08:00

1 2 3

142 Commits