LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-02-09 07:22:19 +08:00

Author	SHA1	Message	Date
SrWYG	a4c4b8496f	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: cf00f78650a442c85678ce805e030d2b96cbecd7	2025-02-13 02:19:03 +08:00
Noah	38c9641777	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 1569e6096fec07da5583f1a3435b0d23ae09b5ba	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	46203856fc	[breaking change] refactor data pipeline (#6901 ) * refactor data * rename file Former-commit-id: 7a1a4ce6451cb782573d0bd9dd27a5e443e3a18b	2025-02-13 00:39:20 +08:00
marko1616	0c0cdc26bc	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: 1563e89adc8988fc6e4250634a3f1e385979b0e5	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	2581cc844b	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: 0c6c9150db6414a5a05527ea486dce6633dff4b3	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	86063e27ea	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	88eafd865b	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	3f7bd98bfa	[data] refactor template (#6896 ) Former-commit-id: f78d5a3eca947ed965ca2f6c87d60441b1a59867	2025-02-11 17:59:25 +08:00
hoshi-hiyouga	808ff89a2d	[data] refactor mm plugin (#6895 ) * refactor plugin * lint Former-commit-id: 1c8dcc3adca4a2e78f514f8bb70573dd1ca08746	2025-02-11 16:34:49 +08:00
HJ	6d7f1299bd	[data] fix qwen_2_5_vl video processing (#6868 ) * fix qwen_2_5_vl video processing * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 35f326dabdc8e84036296d2e3de1c84c67b8def8	2025-02-11 16:14:50 +08:00
Zhangchi Feng	2047eab723	[da'ta] fix minicpmv plugin (#6890 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo * fix no mm inputs Former-commit-id: cdd19ccd8cec460606b4545e886e932c1c5c5fe1	2025-02-11 13:30:44 +08:00
HJ	e11b40c344	[data] fix: sharegpt converter (#6879 ) * fix-sharegpt-format * fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: ae8f8151ff750839998b50446f127061f240d41a	2025-02-10 21:59:12 +08:00
hoshi-hiyouga	b869506a57	[data] fix mllama collator (#6874 ) Former-commit-id: c694fa3d66651c6ce547fa72c8260c46a406126b	2025-02-09 22:42:25 +08:00
hoshi-hiyouga	72d5b06b08	[test] align test cases (#6865 ) * align test cases * fix function formatter Former-commit-id: a68f5e22d0391c80a9a826dc83967255be572032	2025-02-09 01:03:49 +08:00
hoshi-hiyouga	94726bdc8d	[dataset] add openthought (#6866 ) Former-commit-id: 20c748a4f108c0087f0d85377a4aa99126a0beb0	2025-02-09 00:53:01 +08:00
hoshi-hiyouga	528e06ccaa	fix qwen2vl plugin (#6855 ) Former-commit-id: fd13b7138ab3f4da0a429a327b9d076bcb70b944	2025-02-08 10:59:10 +08:00
Zhangchi Feng	8f401e37f8	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 5eacb5629e4d7733cd992a63747a1335f2c6a929	2025-02-05 04:59:09 +08:00
Yueqi Song	9feb78e7b4	[data] allow thought in function call (#6797 ) * Update template.py * Update template.py * use formatter * fix regex --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 3a31af6e920683ec074da93b1719e29f5d4cffd6	2025-02-05 02:26:23 +08:00
hoshi-hiyouga	c2022431aa	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: d9ae594178796994d400a5f207d6499712816f89	2025-02-05 01:53:33 +08:00
Yueqi Song	0817c24c04	[data] fix qwen tool template (#6796 ) * Update tool_utils.py * fix unittest --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 02bb78a792112f5151b3a96ddde2528823855288	2025-02-05 00:02:00 +08:00
Zhangchi Feng	cfb926fb84	[data] fix minicpmv plugin (#6801 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo Former-commit-id: 8f704c8b6228ef50f828014f85dce67fda868660	2025-02-04 21:20:15 +08:00
hoshi-hiyouga	a28261a866	[model] add mistral small models (#6786 ) Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744	2025-02-01 04:31:38 +08:00
hoshi-hiyouga	800de98dc8	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: ed46fb4f6194c30060b908092464dded12e5787c	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	e71737351f	[webui] improve webui & reasoning mode (#6778 ) Former-commit-id: 3f17fc0d7163372e0446f1a38792ff761e99b739	2025-01-31 00:09:21 +08:00
hoshi-hiyouga	87d685b59f	[model] support yarn (#6693 ) Former-commit-id: 8c412abc44a4c61b683465e36c6288580d980250	2025-01-18 13:56:09 +08:00
hoshi-hiyouga	5baa3add8c	[misc] update mm plugin (#6691 ) Former-commit-id: 00303338d6927b1fda58b23340a31a8fa009f706	2025-01-17 23:04:26 +08:00
Zhangchi Feng	3607caa2ad	[data] Fix minicpmv/o dpo training (#6657 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv Former-commit-id: 8d9f47b98047f370637d1c96c2f3440dcc738ef3	2025-01-15 17:30:37 +08:00
hoshi-hiyouga	41a9e231cb	lint (#6641 ) Former-commit-id: 79731ae13ecd17eb8646fb53162c81dddfef3b00	2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)	1bb06e06df	Support InternLM3 Dense 8B Model (#6640 ) * support internlm3 * update * update * update * add hint Former-commit-id: 24ab7ae0944c5f373e9cac60f0332e704824a057	2025-01-14 18:07:27 +08:00
hoshi-hiyouga	d0da6f40b0	[model] fix mllama any image (#6637 ) * fix mllama any image * reorder classes Former-commit-id: 1242a1c4b4a465c06363fdc59302e80e5c4c96e6	2025-01-14 16:47:58 +08:00
Zhangchi Feng	ae32c148d1	Support new features of MiniCPM-V (#6626 ) * fix template name * tiny fix * support minicpm-o-2.6 Former-commit-id: 53034a61c7654358f46916cbc370910fb2aeff3b	2025-01-14 00:26:19 +08:00
hoshi-hiyouga	2a05941b14	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f	2025-01-13 21:34:20 +08:00
Zhangchi Feng	73c1c15b62	Fix template name of MiniCPM-V (#6620 ) * fix template name * tiny fix Former-commit-id: 94dea52cef709a7e6f1cdc0b78e83e0422bd65d3	2025-01-13 16:46:48 +08:00
fzc8578	ec552372ba	remove tests Former-commit-id: 51addcd7ab81548a9952064dd8c95a8542252003	2025-01-13 15:08:35 +08:00
fzc8578	4b61610b12	fix style Former-commit-id: 76a36d9acecbf36b6959a14caacfed1d32bcee41	2025-01-13 14:19:38 +08:00
fzc8578	07798e4aad	fix system prompt and tests Former-commit-id: 955efca677b299749f3d40d587ee310951537543	2025-01-13 14:18:06 +08:00
fzc8578	6d6acd0213	add some Former-commit-id: 5ad8ef3ec434f53f6fc494474becb034a3aca0ca	2025-01-11 15:03:20 +08:00
fzc8578	31bfdb08cd	fix format Former-commit-id: 964e18be5a824950164bc7232d35822a8b116d1a	2025-01-11 01:27:40 +08:00
fzc8578	12c83e00fc	add some Former-commit-id: 6233764d18f31365e9ba450408306fad55567ffc	2025-01-11 01:10:24 +08:00
fzc8578	9dc7b6c7ac	adapt to new mllm_param Former-commit-id: 0775b71965863c2618c117726a1046a36d6d85b8	2025-01-11 00:16:34 +08:00
Zhangchi Feng	627548bf7f	Merge branch 'main' into minicpmv Former-commit-id: 8a9c90759feda975faadc5858bd44b7ea116e7fb	2025-01-11 00:01:36 +08:00
hiyouga	dc65ecdf09	refactor mllm param logic Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246	2025-01-10 15:45:48 +00:00
fzc8578	1f3b729a4b	add some Former-commit-id: 58f50b8729083e9ea0fdcf07042b06261670ad57	2025-01-10 23:29:06 +08:00
fzc8578	0aa7ac210f	add some Former-commit-id: 3acd151a0f8efdd230c0b0980550795d204a69f7	2025-01-10 21:25:32 +08:00
fzc8578	40382f1387	fix some Former-commit-id: 1eb7118db3ad6054cfd59d5f16a5d882e40e9057	2025-01-10 20:55:52 +08:00
fzc8578	e63c2df0b1	fix some Former-commit-id: cd5a1a8b9c6eb59d6e95f79573f60ad8668f1942	2025-01-10 20:27:06 +08:00
Zhangchi Feng	8c0a721c4c	Merge branch 'main' into minicpmv Former-commit-id: d8840ae416660e23f1d615ffd404f519360151d9	2025-01-10 20:12:07 +08:00
fzc8578	9e972bc9ec	add some Former-commit-id: fede563aeb716ba5d1e368fd3e1182e4e580d248	2025-01-10 20:01:22 +08:00
hiyouga	867980196e	improve template, add phi4 model Former-commit-id: a785b6796e445a3adba45c5b6947166a2ff99871	2025-01-09 18:27:54 +00:00
hiyouga	647c51a772	imporve log Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8	2025-01-08 09:56:10 +00:00

1 2 3 4 5 ...

351 Commits