LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-07 04:05:58 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	42e090d38b	[trainer] fix vlm loss for transformers 4.49 (#7448 )	2025-03-24 10:24:05 +08:00
hoshi-hiyouga	b1b78daf06	[deps] upgrade transformers to 4.50.0 (#7437 ) * upgrade transformers * fix hf cache * fix dpo trainer	2025-03-23 17:44:27 +08:00
hoshi-hiyouga	4a5d0f0ba7	[assets] update wechat (#7361 )	2025-03-18 21:31:09 +08:00
hoshi-hiyouga	1d2131e5cb	[data] fix template (#7349 )	2025-03-17 23:45:20 +08:00
Hertz	a71e685021	[model] support hunyuan 7b (#7317 ) * [Model]supported tencent-hunyuan model * [Model]supported tencent-hunyuan model(fix) * [Model]supported tencent-hunyuan model(fix)	2025-03-15 20:55:24 +08:00
hoshi-hiyouga	ef5f1c1def	[data] gemma3 plugin pan and scan (#7294 ) * gemma3 pan and scan * add test case * fix test	2025-03-13 23:29:23 +08:00
Ritesh Goru	d7d79f7e06	[data] efficient 4d_attention_mask creation in neat_packing (#7272 )	2025-03-13 03:31:12 +08:00
hoshi-hiyouga	9ccfb97a2c	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	165d3ed084	[model] support gemma3 (#7273 )	2025-03-13 01:35:23 +08:00
hoshi-hiyouga	7c1640ed5f	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00
hiyouga	37b844d929	remove exit in preprocess Former-commit-id: `1a800f9993`	2025-03-11 15:08:25 +08:00
hoshi-hiyouga	df63f05b47	[data] fix loader (#7207 ) * fix dataloader * add test case * fix type * fix ci * fix ci * fix ci * disable overwrite cache in ci Former-commit-id: `8c3f9f6747`	2025-03-07 17:20:46 +08:00
ZhangChuanhui	33b4c33279	[data] fix function formatter (#7201 ) Co-authored-by: zhangchuanhui <zhangchal@digitalchina.com> Former-commit-id: `194e3bddb2`	2025-03-07 15:17:23 +08:00
hoshi-hiyouga	2b21c749c1	[data] fix mm template (#7181 ) Former-commit-id: `be66df1f02`	2025-03-06 15:18:32 +08:00
hoshi-hiyouga	6e58115f98	[trainer] update config (#7174 ) Former-commit-id: `b4b89b4ff3`	2025-03-05 23:32:54 +08:00
sirui.li	8dddffa340	[data] fix qwen2audio plugin (#7166 ) * Update pairwise.py [data]Repair multimodal model dpo training * Update pairwise.py [data]repair multimodal model dpo training using deepcopy * Update pairwise.py * Update mm_plugin.py Former-commit-id: `dff4130969`	2025-03-05 18:03:36 +08:00
hoshi-hiyouga	caef0a8937	[data] use bicubic resampler (#7143 ) Former-commit-id: `bc298c60b7`	2025-03-04 00:17:06 +08:00
rabbit	299cd03785	[data] bailing template (#7117 ) * add bailing template * add bailing template * add bailing template --------- Co-authored-by: chengshiwen.csw@antgroup.com <chengshiwen.csw@antgroup.com> Former-commit-id: `049ddf48af`	2025-03-03 15:33:22 +08:00
hoshi-hiyouga	dca5fe14c2	[data] fix mllama (#7053 ) * fix mllama * fix test Former-commit-id: `76314e6ad1`	2025-02-24 22:05:38 +08:00
hoshi-hiyouga	ca78ba964d	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: `ec1a1bc118`	2025-02-24 22:05:13 +08:00
Zhangchi Feng	1fcedf9af6	[data] fix MiniCPMV plugin (#6998 ) * fix template * fix bug in messages processing Former-commit-id: `cde479e47a`	2025-02-19 19:36:04 +08:00
hoshi-hiyouga	3fbd4848e8	[version] support transformers 449 (#6982 ) * support transformers 449 * fix mm plugin Former-commit-id: `b00b290c07`	2025-02-18 17:05:40 +08:00
hoshi-hiyouga	184c5d0882	[misc] fix script (#6977 ) Former-commit-id: `cc8c7e762b`	2025-02-18 17:00:46 +08:00
hoshi-hiyouga	1f4a0b11ba	[data] update vlm args (#6976 ) Former-commit-id: `3da2cc2710`	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b1d31ff0f9	[data] add min resolution option (#6975 ) Former-commit-id: `7faecc0301`	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	a8c9d5663d	[data] fix predict dataset (#6972 ) Former-commit-id: `bdb581c4a8`	2025-02-17 20:29:40 +08:00
Zhangchi Feng	3dc938268c	[data] fix minicpmo template (#6946 ) Former-commit-id: `2faf8aeff8`	2025-02-15 00:37:41 +08:00
hoshi-hiyouga	2baf8bf03d	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: `1ada3ae5a3`	2025-02-14 21:38:43 +08:00
SrWYG	0ad9f7f058	[data] evaluate on each dataset (#5522 ) * [Update] loader.py , evaluate will run separate evaluations on each dataset. `If you pass a dictionary with names of datasets as keys and datasets as values, evaluate will run separate evaluations on each dataset. This can be useful to monitor how training affects other datasets or simply to get a more fine-grained evaluation` seq2seqtrainner support eval_dataset as Dict. * fix format * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `1e35967ae1`	2025-02-13 02:19:03 +08:00
Noah	1adb46875f	[data] improve error handling (#6128 ) * sync from upstream * update * update * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `4c7bfebcf1`	2025-02-13 01:39:41 +08:00
hoshi-hiyouga	1679930e00	[breaking change] refactor data pipeline (#6901 ) * refactor data * rename file Former-commit-id: `617c8ab467`	2025-02-13 00:39:20 +08:00
marko1616	bae934dea3	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: `b7fd1e9c00`	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	2e2f6bea07	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: `2f8b6847f5`	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	197aa3baf4	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: `e1a7c1242c`	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	c6be9e242c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: `9184a6e0ed`	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	2e954d8fd2	[data] refactor template (#6896 ) Former-commit-id: `d1b8aa3835`	2025-02-11 17:59:25 +08:00
hoshi-hiyouga	593acca556	[data] refactor mm plugin (#6895 ) * refactor plugin * lint Former-commit-id: `aca63bfcca`	2025-02-11 16:34:49 +08:00
HJ	188f22d8a7	[data] fix qwen_2_5_vl video processing (#6868 ) * fix qwen_2_5_vl video processing * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `9153a7bd83`	2025-02-11 16:14:50 +08:00
Zhangchi Feng	5433b318bb	[da'ta] fix minicpmv plugin (#6890 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo * fix no mm inputs Former-commit-id: `764627645a`	2025-02-11 13:30:44 +08:00
HJ	fe4f4e9758	[data] fix: sharegpt converter (#6879 ) * fix-sharegpt-format * fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `0fb44cb3a5`	2025-02-10 21:59:12 +08:00
hoshi-hiyouga	1bb3d17d9e	[data] fix mllama collator (#6874 ) Former-commit-id: `b68199db27`	2025-02-09 22:42:25 +08:00
hoshi-hiyouga	b93333685b	[test] align test cases (#6865 ) * align test cases * fix function formatter Former-commit-id: `f6f3f8d0fc`	2025-02-09 01:03:49 +08:00
hoshi-hiyouga	fcd0f0480d	[dataset] add openthought (#6866 ) Former-commit-id: `1356f9d840`	2025-02-09 00:53:01 +08:00
hoshi-hiyouga	28037c7834	fix qwen2vl plugin (#6855 ) Former-commit-id: `40048ab77a`	2025-02-08 10:59:10 +08:00
Zhangchi Feng	01915eaf40	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `24c7842948`	2025-02-05 04:59:09 +08:00
Yueqi Song	e665e1fed5	[data] allow thought in function call (#6797 ) * Update template.py * Update template.py * use formatter * fix regex --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `a5e943f7bc`	2025-02-05 02:26:23 +08:00
hoshi-hiyouga	1fee69f874	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: `e2dc5b952a`	2025-02-05 01:53:33 +08:00
Yueqi Song	8504bde893	[data] fix qwen tool template (#6796 ) * Update tool_utils.py * fix unittest --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `dd6b7d203e`	2025-02-05 00:02:00 +08:00
Zhangchi Feng	85f22d01bf	[data] fix minicpmv plugin (#6801 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo Former-commit-id: `ab9bd068ef`	2025-02-04 21:20:15 +08:00
hoshi-hiyouga	445d643ef3	[model] add mistral small models (#6786 ) Former-commit-id: `94803d8133`	2025-02-01 04:31:38 +08:00

1 2 3 4 5 ...

329 Commits