LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-16 17:15:59 +08:00

Author	SHA1	Message	Date
hiyouga	3722d04db1	release v0.9.2 Former-commit-id: e7ed1782d4a006400de6fc0f864abd01f7fadeea	2025-03-11 14:49:13 +08:00
hoshi-hiyouga	6c5927ba93	[infer] fix vllm args (#7235 ) Former-commit-id: 999be5b4512890b8cf4f45874a77e35cf35626f5	2025-03-11 01:15:35 +08:00
Ze-Yi LIN	1358ad9afd	[tracking] add swanlab_logdir param (#7219 ) * feat: add swanlab_logdir param * fix Former-commit-id: 9215ad488b6ac6cd57fe8fa4acdacceb63f68ca5	2025-03-11 00:53:07 +08:00
hoshi-hiyouga	c6331546a9	[config] update args (#7231 ) Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	3c6f735cc3	[config] fix export max len (#7230 ) Former-commit-id: 211c0b3e8f3340acd2fae1762d9152a09f19ba34	2025-03-10 16:46:08 +08:00
hoshi-hiyouga	39ebcd222d	[data] update mm demo data (#7211 ) Former-commit-id: a6070050bbdc96a95d0f972e427a143bda1eb663	2025-03-07 20:07:15 +08:00
hoshi-hiyouga	d66cc2a161	[assets] update readme (#7209 ) Former-commit-id: d1631b38dad9ba3d41aebbb00e3500eb79b9e8e9	2025-03-07 17:27:49 +08:00
hoshi-hiyouga	8c7917d1a2	[data] fix loader (#7207 ) * fix dataloader * add test case * fix type * fix ci * fix ci * fix ci * disable overwrite cache in ci Former-commit-id: e84af0e140b1aafd1a6d6fe185a8e41c8fc5f831	2025-03-07 17:20:46 +08:00
hoshi-hiyouga	678c65fc28	[misc] fix ds config (#7205 ) Former-commit-id: b478fa1d9de1858075769f86f57126fde92db813	2025-03-07 15:21:28 +08:00
ZhangChuanhui	ee9bd3f2d7	[data] fix function formatter (#7201 ) Co-authored-by: zhangchuanhui <zhangchal@digitalchina.com> Former-commit-id: 3efb32b986170d2839e526640f85ba230715879a	2025-03-07 15:17:23 +08:00
hoshi-hiyouga	63e4b14565	[misc] fix cli (#7204 ) Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4	2025-03-07 15:01:18 +08:00
hoshi-hiyouga	1c924ebeec	[script] fix vllm version (#7193 ) Former-commit-id: ababdde597b2b9bf0ab3f30f036bc8d97de07f03	2025-03-06 17:14:17 +08:00
hoshi-hiyouga	25ee957d37	[webui] support escape html (#7190 ) Former-commit-id: cf9840374f171359c828b0d6f7a2aa9893c8f701	2025-03-06 16:52:21 +08:00
hoshi-hiyouga	00245a86e6	[deps] upgrade vllm (#7183 ) Former-commit-id: 37678a3d64668c3b4a4bfefc054e3b9b40427c1a	2025-03-06 15:25:08 +08:00
hoshi-hiyouga	26fddbd7c4	[data] fix mm template (#7181 ) Former-commit-id: 648616d473c81d393592806307e3e25b159cb278	2025-03-06 15:18:32 +08:00
hoshi-hiyouga	25546b9afe	[model] add QwQ 32b (#7179 ) Former-commit-id: 8897e48b8cd55407812453ddd4ff98ac7bdc4e91	2025-03-06 11:58:36 +08:00
Ze-Yi LIN	754dbb8b07	[trainer] fix swanlab callback (#7176 ) Former-commit-id: 6d9acf4bd30db24499118aee16bd19cb19ba9e3d	2025-03-06 00:33:37 +08:00
hoshi-hiyouga	54bcc37f55	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
sirui.li	58943fa554	[data] fix qwen2audio plugin (#7166 ) * Update pairwise.py [data]Repair multimodal model dpo training * Update pairwise.py [data]repair multimodal model dpo training using deepcopy * Update pairwise.py * Update mm_plugin.py Former-commit-id: 86763dfdb8e9e5668c1ddd7e924e4be76bf78368	2025-03-05 18:03:36 +08:00
hoshi-hiyouga	51821b91ae	[data] use bicubic resampler (#7143 ) Former-commit-id: c708f19ab0ab57526134952afddaa90aae8decbf	2025-03-04 00:17:06 +08:00
hoshi-hiyouga	fe34e58546	[webui] fix webui (#7142 ) Former-commit-id: d07281f8a45ad8a38d390181d01dcadbcf9aa1b9	2025-03-04 00:01:49 +08:00
rabbit	7c0684a2cf	[data] bailing template (#7117 ) * add bailing template * add bailing template * add bailing template --------- Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com> Former-commit-id: 4a36f5e0abb5a63f4b3b81560bb1ad0e6832d379	2025-03-03 15:33:22 +08:00
hoshi-hiyouga	7257ab2b23	[inference] fix hf_engine (#7120 ) Former-commit-id: f8cf5319cb5d6e06a1b0d8b8db2b678627f2271e	2025-03-01 05:22:49 +08:00
hoshi-hiyouga	75dde8856c	[assets] update wechat (#7106 ) Former-commit-id: 0ea430060994631e9fdb18fbbca0dd565a04fd66	2025-02-28 12:01:04 +08:00
Ze-Yi LIN	b531afb74e	[webui] display swanlab exp link (#7089 ) * webui add swanlab link * change callback name * update --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b	2025-02-27 19:40:54 +08:00
leo-pony	1590c3861c	[npu] update cann base image and torch 2.4 (#7061 ) * Update base npu container image version:The Python version required for Hugging Face Transformers is >= python3.10 * Fix the bug: arg type of INSTALL_DEEPSPEED shoud been string now. * Update Ascend CANN, CANN-Kernel and corresponding torch and torch-npu version * Upgrade torch-npu needs packages' version: torch==2.1.0 and torch-npu==2.4.0.post2 Former-commit-id: d6dafada58412b0c801e576ef4d8d96203f792af	2025-02-25 23:32:01 +08:00
hoshi-hiyouga	4d737be5a2	[misc] fix project toml (#7067 ) Former-commit-id: 28a668ff4e0beebfe5387362f5518c1d9343666f	2025-02-25 23:22:48 +08:00
JieShen	4f8c144f90	[script] add seed args (#7058 ) * add seed args * add seed args * update seed Former-commit-id: eb9770b2c01a840b6a0ac119210c22bdbb81e18b	2025-02-25 19:44:57 +08:00
Kingsley	43a106748c	[model] add paligemma2-mix series (#7060 ) Former-commit-id: 0c0196306d343242ee5e6f22c55562f9a74aa782	2025-02-25 18:51:16 +08:00
hoshi-hiyouga	76f860a228	[data] fix mllama (#7053 ) * fix mllama * fix test Former-commit-id: f5af20a63f3d59a6a68d323a7c6f68e551edb3a3	2025-02-24 22:05:38 +08:00
hoshi-hiyouga	a354df6d90	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916	2025-02-24 22:05:13 +08:00
hoshi-hiyouga	6831083852	[assets] update readme (#7051 ) Former-commit-id: c89a39bfc6a3f0aaa376cd1b221320f466aba617	2025-02-24 20:45:06 +08:00
hoshi-hiyouga	22281363d0	[assets] update wechat (#7019 ) Former-commit-id: 3d102fe7e0bfc23db7d75f90ebaf53216c54cc85	2025-02-20 20:32:33 +08:00
Zhangchi Feng	354a0a28b3	[data] fix MiniCPMV plugin (#6998 ) * fix template * fix bug in messages processing Former-commit-id: f98b828f53968fb9c72bff9e45510ad5586c4fab	2025-02-19 19:36:04 +08:00
hoshi-hiyouga	040175ac8e	[webui] update css (#6985 ) Former-commit-id: 760a1dfb8193de418d7aa1063c0d111a3a64ae0f	2025-02-18 18:27:57 +08:00
hoshi-hiyouga	b50ca5cafa	[data] add r1 distill dataset (#6983 ) Former-commit-id: 1da5ee4edaa3896593b9cae488f0ac5917c3243e	2025-02-18 17:25:09 +08:00
hoshi-hiyouga	865b2b8b87	[version] support transformers 449 (#6982 ) * support transformers 449 * fix mm plugin Former-commit-id: e9118a9df0839d24f6ddff5a0b55ef101a1d3d22	2025-02-18 17:05:40 +08:00
hoshi-hiyouga	ccc656376f	[misc] fix script (#6977 ) Former-commit-id: 775efa1d8cbdb1b7d122be2a986d47f85214e0a1	2025-02-18 17:00:46 +08:00
hoshi-hiyouga	5deefc6094	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b7ccfd28d1	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	4e661b63e3	[data] fix predict dataset (#6972 ) Former-commit-id: f9a82e527877b1ed47cabb3d34f4d155705f4048	2025-02-17 20:29:40 +08:00
Zhangchi Feng	69fcc8e0c0	[data] fix minicpmo template (#6946 ) Former-commit-id: 09e4438b58d5c1a5fdde37ff781c3d79461c4743	2025-02-15 00:37:41 +08:00
Eric Tang	413aa5944a	[ray] specify ray storage path (#6920 ) Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49	2025-02-14 21:55:41 +08:00
hoshi-hiyouga	1cda37892e	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: 1d0ecbaee1b72f1e03154ddd4fcc8b7876e01f89	2025-02-14 21:38:43 +08:00
hoshi-hiyouga	6ebe81e04d	[misc] fix grad ckpt (#6931 ) Former-commit-id: deae1fc9a0bea5c8b8be1564cf9c81c9c02a0b3a	2025-02-13 23:27:51 +08:00
hoshi-hiyouga	a9b4e229af	[model] add liger kernel to qwen2_5 vl (#6930 ) * add liger kernel to qwen2_5 vl * fix patch * fix patch Former-commit-id: 828776d155986166498dfc907194f64436571106	2025-02-13 23:05:54 +08:00
Billy Cao	680648098b	[trainer] fix gen_kwarg to eval during training (#5451 ) * Correctly pass gen_kwarg to eval during model runs * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 845d16122496311e08263610a6a922f82604de7b	2025-02-13 02:35:06 +08:00
SrWYG	d9ea4baf00	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: cf00f78650a442c85678ce805e030d2b96cbecd7	2025-02-13 02:19:03 +08:00
Noah	f1c2ae9d47	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 1569e6096fec07da5583f1a3435b0d23ae09b5ba	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	a7448ef8f3	[misc] update readme (#6918 ) Former-commit-id: f5823479bd51c39db668b68056be749af09894d1	2025-02-13 01:01:41 +08:00

1 2 3 4 5 ...

2653 Commits