LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-08 12:46:06 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	2baf8bf03d	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: `1ada3ae5a3`	2025-02-14 21:38:43 +08:00
SrWYG	0ad9f7f058	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `1e35967ae1`	2025-02-13 02:19:03 +08:00
Noah	1adb46875f	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `4c7bfebcf1`	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	1679930e00	[breaking change] refactor data pipeline (#6901 ) * refactor data * rename file Former-commit-id: `617c8ab467`	2025-02-13 00:39:20 +08:00
marko1616	bae934dea3	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: `b7fd1e9c00`	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	2e2f6bea07	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: `2f8b6847f5`	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	197aa3baf4	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: `e1a7c1242c`	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	c6be9e242c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: `9184a6e0ed`	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	2e954d8fd2	[data] refactor template (#6896 ) Former-commit-id: `d1b8aa3835`	2025-02-11 17:59:25 +08:00
hoshi-hiyouga	593acca556	[data] refactor mm plugin (#6895 ) * refactor plugin * lint Former-commit-id: `aca63bfcca`	2025-02-11 16:34:49 +08:00
HJ	188f22d8a7	[data] fix qwen_2_5_vl video processing (#6868 ) * fix qwen_2_5_vl video processing * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `9153a7bd83`	2025-02-11 16:14:50 +08:00
Zhangchi Feng	5433b318bb	[da'ta] fix minicpmv plugin (#6890 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo * fix no mm inputs Former-commit-id: `764627645a`	2025-02-11 13:30:44 +08:00
HJ	fe4f4e9758	[data] fix: sharegpt converter (#6879 ) * fix-sharegpt-format * fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `0fb44cb3a5`	2025-02-10 21:59:12 +08:00
hoshi-hiyouga	1bb3d17d9e	[data] fix mllama collator (#6874 ) Former-commit-id: `b68199db27`	2025-02-09 22:42:25 +08:00
hoshi-hiyouga	b93333685b	[test] align test cases (#6865 ) * align test cases * fix function formatter Former-commit-id: `f6f3f8d0fc`	2025-02-09 01:03:49 +08:00
hoshi-hiyouga	fcd0f0480d	[dataset] add openthought (#6866 ) Former-commit-id: `1356f9d840`	2025-02-09 00:53:01 +08:00
hoshi-hiyouga	28037c7834	fix qwen2vl plugin (#6855 ) Former-commit-id: `40048ab77a`	2025-02-08 10:59:10 +08:00
Zhangchi Feng	01915eaf40	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `24c7842948`	2025-02-05 04:59:09 +08:00
Yueqi Song	e665e1fed5	[data] allow thought in function call (#6797 ) * Update template.py * Update template.py * use formatter * fix regex --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `a5e943f7bc`	2025-02-05 02:26:23 +08:00
hoshi-hiyouga	1fee69f874	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: `e2dc5b952a`	2025-02-05 01:53:33 +08:00
Yueqi Song	8504bde893	[data] fix qwen tool template (#6796 ) * Update tool_utils.py * fix unittest --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `dd6b7d203e`	2025-02-05 00:02:00 +08:00
Zhangchi Feng	85f22d01bf	[data] fix minicpmv plugin (#6801 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo Former-commit-id: `ab9bd068ef`	2025-02-04 21:20:15 +08:00
hoshi-hiyouga	445d643ef3	[model] add mistral small models (#6786 ) Former-commit-id: `94803d8133`	2025-02-01 04:31:38 +08:00
hoshi-hiyouga	e8c1979b79	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: `999c7c8fe0`	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	245de012ca	[webui] improve webui & reasoning mode (#6778 ) Former-commit-id: `45e68b9f09`	2025-01-31 00:09:21 +08:00
hoshi-hiyouga	1efe525df7	[model] support yarn (#6693 ) Former-commit-id: `1f47b6186c`	2025-01-18 13:56:09 +08:00
hoshi-hiyouga	f87c788154	[misc] update mm plugin (#6691 ) Former-commit-id: `c0caa7afc6`	2025-01-17 23:04:26 +08:00
Zhangchi Feng	555f17c1ee	[data] Fix minicpmv/o dpo training (#6657 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv Former-commit-id: `027942789b`	2025-01-15 17:30:37 +08:00
hoshi-hiyouga	91433d639c	lint (#6641 ) Former-commit-id: `1278c3e92e`	2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)	864ee06243	Support InternLM3 Dense 8B Model (#6640 ) * support internlm3 * update * update * update * add hint Former-commit-id: `deacc00b12`	2025-01-14 18:07:27 +08:00
hoshi-hiyouga	8f73c75c16	[model] fix mllama any image (#6637 ) * fix mllama any image * reorder classes Former-commit-id: `98189c8e4d`	2025-01-14 16:47:58 +08:00
Zhangchi Feng	201a495154	Support new features of MiniCPM-V (#6626 ) * fix template name * tiny fix * support minicpm-o-2.6 Former-commit-id: `c3fda5046d`	2025-01-14 00:26:19 +08:00
hoshi-hiyouga	d8cba9464f	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: `e3e2c8c689`	2025-01-13 21:34:20 +08:00
Zhangchi Feng	15bba15725	Fix template name of MiniCPM-V (#6620 ) * fix template name * tiny fix Former-commit-id: `3077f20339`	2025-01-13 16:46:48 +08:00
fzc8578	313ce9a576	remove tests Former-commit-id: `a019cece80`	2025-01-13 15:08:35 +08:00
fzc8578	4741eec2d1	fix style Former-commit-id: `0cc7260a93`	2025-01-13 14:19:38 +08:00
fzc8578	d2afe0c63c	fix system prompt and tests Former-commit-id: `cfaa8e4890`	2025-01-13 14:18:06 +08:00
fzc8578	bdded9d41a	add some Former-commit-id: `01e9cfd406`	2025-01-11 15:03:20 +08:00
fzc8578	e7f928adc4	fix format Former-commit-id: `7b44f3127e`	2025-01-11 01:27:40 +08:00
fzc8578	62c12a133e	add some Former-commit-id: `a650e114e9`	2025-01-11 01:10:24 +08:00
fzc8578	08e8499a98	adapt to new mllm_param Former-commit-id: `291384dea8`	2025-01-11 00:16:34 +08:00
Zhangchi Feng	d5b18ee4a6	Merge branch 'main' into minicpmv Former-commit-id: `ed0895a9c1`	2025-01-11 00:01:36 +08:00
hiyouga	c89d17ab63	refactor mllm param logic Former-commit-id: `f6f630a1c9`	2025-01-10 15:45:48 +00:00
fzc8578	0fb50f9c88	add some Former-commit-id: `771cc80294`	2025-01-10 23:29:06 +08:00
fzc8578	bcbe37ff52	add some Former-commit-id: `ae1f528df3`	2025-01-10 21:25:32 +08:00
fzc8578	994049380d	fix some Former-commit-id: `15bbcdf8d3`	2025-01-10 20:55:52 +08:00
fzc8578	7138b43873	fix some Former-commit-id: `2ee8ba2f39`	2025-01-10 20:27:06 +08:00
Zhangchi Feng	f51ac40f0a	Merge branch 'main' into minicpmv Former-commit-id: `fc045d7dd8`	2025-01-10 20:12:07 +08:00
fzc8578	165fe8e219	add some Former-commit-id: `096a6cb67a`	2025-01-10 20:01:22 +08:00
hiyouga	b471def13d	improve template, add phi4 model Former-commit-id: `ae16ea755d`	2025-01-09 18:27:54 +00:00

1 2 3 4 5 ...

352 Commits