LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-23 15:48:57 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	d2f845d70d	[deps] upgrade vllm (#7183 ) Former-commit-id: 37678a3d64668c3b4a4bfefc054e3b9b40427c1a	2025-03-06 15:25:08 +08:00
hoshi-hiyouga	bb8aba5abf	[data] fix mm template (#7181 ) Former-commit-id: 648616d473c81d393592806307e3e25b159cb278	2025-03-06 15:18:32 +08:00
hoshi-hiyouga	9f16c50155	[model] add QwQ 32b (#7179 ) Former-commit-id: 8897e48b8cd55407812453ddd4ff98ac7bdc4e91	2025-03-06 11:58:36 +08:00
Ze-Yi LIN	25bb9f5ad9	[trainer] fix swanlab callback (#7176 ) Former-commit-id: 6d9acf4bd30db24499118aee16bd19cb19ba9e3d	2025-03-06 00:33:37 +08:00
hoshi-hiyouga	7b985f55db	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
sirui.li	fd0357a26d	[data] fix qwen2audio plugin (#7166 ) * Update pairwise.py [data]Repair multimodal model dpo training * Update pairwise.py [data]repair multimodal model dpo training using deepcopy * Update pairwise.py * Update mm_plugin.py Former-commit-id: 86763dfdb8e9e5668c1ddd7e924e4be76bf78368	2025-03-05 18:03:36 +08:00
hoshi-hiyouga	31f9daa362	[data] use bicubic resampler (#7143 ) Former-commit-id: c708f19ab0ab57526134952afddaa90aae8decbf	2025-03-04 00:17:06 +08:00
hoshi-hiyouga	15ea576246	[webui] fix webui (#7142 ) Former-commit-id: d07281f8a45ad8a38d390181d01dcadbcf9aa1b9	2025-03-04 00:01:49 +08:00
rabbit	19a6916d80	[data] bailing template (#7117 ) * add bailing template * add bailing template * add bailing template --------- Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com> Former-commit-id: 4a36f5e0abb5a63f4b3b81560bb1ad0e6832d379	2025-03-03 15:33:22 +08:00
hoshi-hiyouga	585c475f71	[inference] fix hf_engine (#7120 ) Former-commit-id: f8cf5319cb5d6e06a1b0d8b8db2b678627f2271e	2025-03-01 05:22:49 +08:00
Ze-Yi LIN	11672f760d	[webui] display swanlab exp link (#7089 ) * webui add swanlab link * change callback name * update --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b	2025-02-27 19:40:54 +08:00
hoshi-hiyouga	5f65558088	[misc] fix project toml (#7067 ) Former-commit-id: 28a668ff4e0beebfe5387362f5518c1d9343666f	2025-02-25 23:22:48 +08:00
Kingsley	2986bef530	[model] add paligemma2-mix series (#7060 ) Former-commit-id: 0c0196306d343242ee5e6f22c55562f9a74aa782	2025-02-25 18:51:16 +08:00
hoshi-hiyouga	065f7fb5da	[data] fix mllama (#7053 ) * fix mllama * fix test Former-commit-id: f5af20a63f3d59a6a68d323a7c6f68e551edb3a3	2025-02-24 22:05:38 +08:00
hoshi-hiyouga	c1d5073bd3	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916	2025-02-24 22:05:13 +08:00
Zhangchi Feng	fcf75633a0	[data] fix MiniCPMV plugin (#6998 ) * fix template * fix bug in messages processing Former-commit-id: f98b828f53968fb9c72bff9e45510ad5586c4fab	2025-02-19 19:36:04 +08:00
hoshi-hiyouga	e77ced045d	[webui] update css (#6985 ) Former-commit-id: 760a1dfb8193de418d7aa1063c0d111a3a64ae0f	2025-02-18 18:27:57 +08:00
hoshi-hiyouga	1d675a287d	[version] support transformers 449 (#6982 ) * support transformers 449 * fix mm plugin Former-commit-id: e9118a9df0839d24f6ddff5a0b55ef101a1d3d22	2025-02-18 17:05:40 +08:00
hoshi-hiyouga	be33ef67fb	[misc] fix script (#6977 ) Former-commit-id: 775efa1d8cbdb1b7d122be2a986d47f85214e0a1	2025-02-18 17:00:46 +08:00
hoshi-hiyouga	f5cd17881e	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	c09b648934	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	f2fd9d1b25	[data] fix predict dataset (#6972 ) Former-commit-id: f9a82e527877b1ed47cabb3d34f4d155705f4048	2025-02-17 20:29:40 +08:00
Zhangchi Feng	167342af8a	[data] fix minicpmo template (#6946 ) Former-commit-id: 09e4438b58d5c1a5fdde37ff781c3d79461c4743	2025-02-15 00:37:41 +08:00
Eric Tang	76f9bd1820	[ray] specify ray storage path (#6920 ) Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49	2025-02-14 21:55:41 +08:00
hoshi-hiyouga	a893505924	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: 1d0ecbaee1b72f1e03154ddd4fcc8b7876e01f89	2025-02-14 21:38:43 +08:00
hoshi-hiyouga	ed25e051a9	[misc] fix grad ckpt (#6931 ) Former-commit-id: deae1fc9a0bea5c8b8be1564cf9c81c9c02a0b3a	2025-02-13 23:27:51 +08:00
hoshi-hiyouga	5e5fc337f9	[model] add liger kernel to qwen2_5 vl (#6930 ) * add liger kernel to qwen2_5 vl * fix patch * fix patch Former-commit-id: 828776d155986166498dfc907194f64436571106	2025-02-13 23:05:54 +08:00
Billy Cao	58e9ca8aa0	[trainer] fix gen_kwarg to eval during training (#5451 ) * Correctly pass gen_kwarg to eval during model runs * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 845d16122496311e08263610a6a922f82604de7b	2025-02-13 02:35:06 +08:00
SrWYG	a4c4b8496f	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: cf00f78650a442c85678ce805e030d2b96cbecd7	2025-02-13 02:19:03 +08:00
Noah	38c9641777	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 1569e6096fec07da5583f1a3435b0d23ae09b5ba	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	46203856fc	[breaking change] refactor data pipeline (#6901 ) * refactor data * rename file Former-commit-id: 7a1a4ce6451cb782573d0bd9dd27a5e443e3a18b	2025-02-13 00:39:20 +08:00
hoshi-hiyouga	3a3f4072e5	[misc] fix grad ckpt func (#6916 ) Former-commit-id: 35e069a52b3d7cfd9b0107574b09265eb2290f0b	2025-02-13 00:17:18 +08:00
marko1616	0c0cdc26bc	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: 1563e89adc8988fc6e4250634a3f1e385979b0e5	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	2581cc844b	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: 0c6c9150db6414a5a05527ea486dce6633dff4b3	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	86063e27ea	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	88eafd865b	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	3f7bd98bfa	[data] refactor template (#6896 ) Former-commit-id: f78d5a3eca947ed965ca2f6c87d60441b1a59867	2025-02-11 17:59:25 +08:00
hoshi-hiyouga	808ff89a2d	[data] refactor mm plugin (#6895 ) * refactor plugin * lint Former-commit-id: 1c8dcc3adca4a2e78f514f8bb70573dd1ca08746	2025-02-11 16:34:49 +08:00
HJ	6d7f1299bd	[data] fix qwen_2_5_vl video processing (#6868 ) * fix qwen_2_5_vl video processing * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 35f326dabdc8e84036296d2e3de1c84c67b8def8	2025-02-11 16:14:50 +08:00
Zhangchi Feng	2047eab723	[da'ta] fix minicpmv plugin (#6890 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo * fix no mm inputs Former-commit-id: cdd19ccd8cec460606b4545e886e932c1c5c5fe1	2025-02-11 13:30:44 +08:00
HJ	e11b40c344	[data] fix: sharegpt converter (#6879 ) * fix-sharegpt-format * fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: ae8f8151ff750839998b50446f127061f240d41a	2025-02-10 21:59:12 +08:00
hoshi-hiyouga	b869506a57	[data] fix mllama collator (#6874 ) Former-commit-id: c694fa3d66651c6ce547fa72c8260c46a406126b	2025-02-09 22:42:25 +08:00
hoshi-hiyouga	72d5b06b08	[test] align test cases (#6865 ) * align test cases * fix function formatter Former-commit-id: a68f5e22d0391c80a9a826dc83967255be572032	2025-02-09 01:03:49 +08:00
hoshi-hiyouga	94726bdc8d	[dataset] add openthought (#6866 ) Former-commit-id: 20c748a4f108c0087f0d85377a4aa99126a0beb0	2025-02-09 00:53:01 +08:00
hoshi-hiyouga	4d1791e905	[deps] upgrade vllm (#6857 ) Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd	2025-02-08 15:02:28 +08:00
hoshi-hiyouga	528e06ccaa	fix qwen2vl plugin (#6855 ) Former-commit-id: fd13b7138ab3f4da0a429a327b9d076bcb70b944	2025-02-08 10:59:10 +08:00
hoshi-hiyouga	fec641ec82	[misc] allow extra args (#6831 ) Former-commit-id: 0fd3a5295cb4e08a4e57e860e82103364c28fba8	2025-02-06 12:38:08 +08:00
Zhangchi Feng	8f401e37f8	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 5eacb5629e4d7733cd992a63747a1335f2c6a929	2025-02-05 04:59:09 +08:00
Yueqi Song	9feb78e7b4	[data] allow thought in function call (#6797 ) * Update template.py * Update template.py * use formatter * fix regex --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 3a31af6e920683ec074da93b1719e29f5d4cffd6	2025-02-05 02:26:23 +08:00
hoshi-hiyouga	c2022431aa	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: d9ae594178796994d400a5f207d6499712816f89	2025-02-05 01:53:33 +08:00

... 2 3 4 5 6 ...

1933 Commits