LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-22 23:28:57 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	1f4a0b11ba	[data] update vlm args (#6976 ) Former-commit-id: `3da2cc2710`	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b1d31ff0f9	[data] add min resolution option (#6975 ) Former-commit-id: `7faecc0301`	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	a8c9d5663d	[data] fix predict dataset (#6972 ) Former-commit-id: `bdb581c4a8`	2025-02-17 20:29:40 +08:00
hoshi-hiyouga	475a355b82	[assets] update wechat (#6963 ) Former-commit-id: `ad0c6c8916`	2025-02-17 15:23:17 +08:00
Zhangchi Feng	3dc938268c	[data] fix minicpmo template (#6946 ) Former-commit-id: `2faf8aeff8`	2025-02-15 00:37:41 +08:00
Eric Tang	e55ec42d3c	[ray] specify ray storage path (#6920 ) Former-commit-id: `6edd4992d7`	2025-02-14 21:55:41 +08:00
hoshi-hiyouga	2baf8bf03d	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: `1ada3ae5a3`	2025-02-14 21:38:43 +08:00
hoshi-hiyouga	13e1b7ee2b	[misc] fix grad ckpt (#6931 ) Former-commit-id: `c31c63b411`	2025-02-13 23:27:51 +08:00
hoshi-hiyouga	cd493b91de	[model] add liger kernel to qwen2_5 vl (#6930 ) * add liger kernel to qwen2_5 vl * fix patch * fix patch Former-commit-id: `797043d29c`	2025-02-13 23:05:54 +08:00
Billy Cao	48173b606c	[trainer] fix gen_kwarg to eval during training (#5451 ) * Correctly pass gen_kwarg to eval during model runs * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `11eac71c13`	2025-02-13 02:35:06 +08:00
SrWYG	0ad9f7f058	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `1e35967ae1`	2025-02-13 02:19:03 +08:00
Noah	1adb46875f	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `4c7bfebcf1`	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	9b852ebe25	[misc] update readme (#6918 ) Former-commit-id: `8956c93d9b`	2025-02-13 01:01:41 +08:00
hoshi-hiyouga	07aa7b71a3	[misc] update readme (#6917 ) Former-commit-id: `499ea45d1f`	2025-02-13 00:58:10 +08:00
hoshi-hiyouga	1679930e00	[breaking change] refactor data pipeline (#6901 ) * refactor data * rename file Former-commit-id: `617c8ab467`	2025-02-13 00:39:20 +08:00
Eric Tang	d50e04b805	[misc] support for launching LLaMA-Factory with `uv run` (#6907 ) * yay * uv with ray temporary commit * remove ray specific code for now * cleanup Former-commit-id: `f8a206125d`	2025-02-13 00:38:44 +08:00
Eric Tang	e515fe62de	[example] fix path to ray example (#6906 ) Former-commit-id: `ee5fe216dc`	2025-02-13 00:29:32 +08:00
hoshi-hiyouga	036fb0d561	[misc] fix grad ckpt func (#6916 ) Former-commit-id: `e34c3c06da`	2025-02-13 00:17:18 +08:00
marko1616	bae934dea3	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: `b7fd1e9c00`	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	2e2f6bea07	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: `2f8b6847f5`	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	1b02183da9	[misc] update readme (#6903 ) Former-commit-id: `18179a3823`	2025-02-11 22:51:26 +08:00
hoshi-hiyouga	197aa3baf4	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: `e1a7c1242c`	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	c6be9e242c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: `9184a6e0ed`	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	2e954d8fd2	[data] refactor template (#6896 ) Former-commit-id: `d1b8aa3835`	2025-02-11 17:59:25 +08:00
codingma	fafa3add84	support ollama modelfile export (#4686 ) Former-commit-id: `7f354b80bc`	2025-02-11 17:52:24 +08:00
hoshi-hiyouga	593acca556	[data] refactor mm plugin (#6895 ) * refactor plugin * lint Former-commit-id: `aca63bfcca`	2025-02-11 16:34:49 +08:00
HJ	188f22d8a7	[data] fix qwen_2_5_vl video processing (#6868 ) * fix qwen_2_5_vl video processing * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `9153a7bd83`	2025-02-11 16:14:50 +08:00
hoshi-hiyouga	703bb9cc18	[assets] update wechat (#6892 ) Former-commit-id: `fc5d47401f`	2025-02-11 13:56:26 +08:00
Zhangchi Feng	5433b318bb	[da'ta] fix minicpmv plugin (#6890 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo * fix no mm inputs Former-commit-id: `764627645a`	2025-02-11 13:30:44 +08:00
HJ	fe4f4e9758	[data] fix: sharegpt converter (#6879 ) * fix-sharegpt-format * fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `0fb44cb3a5`	2025-02-10 21:59:12 +08:00
hoshi-hiyouga	1bb3d17d9e	[data] fix mllama collator (#6874 ) Former-commit-id: `b68199db27`	2025-02-09 22:42:25 +08:00
hoshi-hiyouga	b93333685b	[test] align test cases (#6865 ) * align test cases * fix function formatter Former-commit-id: `f6f3f8d0fc`	2025-02-09 01:03:49 +08:00
hoshi-hiyouga	fcd0f0480d	[dataset] add openthought (#6866 ) Former-commit-id: `1356f9d840`	2025-02-09 00:53:01 +08:00
hoshi-hiyouga	ff6658ad27	[deps] upgrade vllm (#6857 ) Former-commit-id: `5f38bcaba9`	2025-02-08 15:02:28 +08:00
hoshi-hiyouga	28037c7834	fix qwen2vl plugin (#6855 ) Former-commit-id: `40048ab77a`	2025-02-08 10:59:10 +08:00
hoshi-hiyouga	f70208e1c0	[misc] allow extra args (#6831 ) Former-commit-id: `74ade3a176`	2025-02-06 12:38:08 +08:00
hoshi-hiyouga	7aa9767dc2	[assets] update wechat (#6830 ) Former-commit-id: `6dad536968`	2025-02-06 12:02:05 +08:00
Zhangchi Feng	01915eaf40	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `24c7842948`	2025-02-05 04:59:09 +08:00
Yueqi Song	e665e1fed5	[data] allow thought in function call (#6797 ) * Update template.py * Update template.py * use formatter * fix regex --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `a5e943f7bc`	2025-02-05 02:26:23 +08:00
hoshi-hiyouga	1fee69f874	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: `e2dc5b952a`	2025-02-05 01:53:33 +08:00
Yueqi Song	8504bde893	[data] fix qwen tool template (#6796 ) * Update tool_utils.py * fix unittest --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `dd6b7d203e`	2025-02-05 00:02:00 +08:00
Zhangchi Feng	85f22d01bf	[data] fix minicpmv plugin (#6801 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo Former-commit-id: `ab9bd068ef`	2025-02-04 21:20:15 +08:00
hoshi-hiyouga	822d5d362c	[assets] update wechat (#6810 ) Former-commit-id: `069a477d16`	2025-02-04 21:17:40 +08:00
neavo	32163e7ce0	[readme] update flash attention installation instruction on win platform (#6788 ) * Update README_zh.md * Update README.md Former-commit-id: `a417bcf8d9`	2025-02-01 12:43:29 +08:00
hoshi-hiyouga	454140d912	[misc] update workflows (#6787 ) Former-commit-id: `b5fda21288`	2025-02-01 04:54:42 +08:00
hoshi-hiyouga	445d643ef3	[model] add mistral small models (#6786 ) Former-commit-id: `94803d8133`	2025-02-01 04:31:38 +08:00
hoshi-hiyouga	e8c1979b79	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: `999c7c8fe0`	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	f6779b0e0c	[breaking] support transformers 4.48 (#6628 ) Former-commit-id: `15357cdad9`	2025-01-31 01:36:33 +08:00
hoshi-hiyouga	245de012ca	[webui] improve webui & reasoning mode (#6778 ) Former-commit-id: `45e68b9f09`	2025-01-31 00:09:21 +08:00
codingma	f143360ee6	[assets] update wechat (#6771 ) Former-commit-id: `4fb6059f48`	2025-01-29 12:31:24 +08:00

1 2 3 4 5 ...

2652 Commits