LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-04-08 21:36:01 +08:00

Author	SHA1	Message	Date
Kingsley	19c49ef284	[model] add arch check for InternVL (#7803 )	2025-04-22 16:38:05 +08:00
hoshi-hiyouga	acf641abc2	[data] improve mmplugin (#7795 )	2025-04-22 01:25:33 +08:00
hoshi-hiyouga	cea9071ed1	[example] add bash usage (#7794 )	2025-04-22 00:25:51 +08:00
flashJd	4d8f459ff6	[misc] fix new tokens adding (#7253 ) Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-21 23:19:02 +08:00
ddddng	0313fbd8b0	[model] fix gemma3 export (#7786 ) Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-21 23:07:11 +08:00
Sachin Beldona	1bae62b773	[misc] fix bug in constant (#7765 ) Co-authored-by: Sachin Beldona <sbeldona@cs.cmu.edu>	2025-04-21 23:06:31 +08:00
hoshi-hiyouga	8208cbf1dc	[trainer] fix pt loss (#7748 ) * fix pt loss * robust * fix * test	2025-04-17 03:15:35 +08:00
hoshi-hiyouga	a0818eae58	[breaking] bump transformers to 4.45.0 & improve ci (#7746 ) * update ci * fix * fix * fix * fix * fix	2025-04-17 02:36:48 +08:00
hoshi-hiyouga	06001ea2f0	[infer] set env for vllm ascend (#7745 )	2025-04-17 01:08:55 +08:00
Kingsley	7a00670f70	[model] support intern-VL 2.5-3 series (#7258 ) * add internvl and rebase * fix for internvl2&3 * remove lines * fix video_inputs & lint * nit * add constants * remove lines * fix * fix error * pass ci * pass ci * skip internvl & nit	2025-04-17 00:31:30 +08:00
Kingsley	d1b695cd9f	[model] Support Kimi_VL thinking/instruct (#7719 ) * add kimi_vl * patch config * check version * Update mm_plugin.py * Update mm_plugin.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-15 00:21:58 +08:00
hoshi-hiyouga	2b92e85cdd	[misc] fix env vars (#7715 )	2025-04-14 16:04:04 +08:00
hoshi-hiyouga	8f46aced51	[misc] upgrade cli (#7714 )	2025-04-14 15:41:22 +08:00
Dain Kim	e60249d597	[bugfix] enable_gemma_liger_kernel (#7660 ) - The `enable_liger_kernel` function for the Gemma model series was not executed due to the existing `if` statement in the code. - Changed the line to an `elif` statement so that the `apply_liger_kernel` function is executed properly. resolved: #7628	2025-04-10 11:27:30 +08:00
Kingsley	0935eff188	[data] Fix bugs of `use_audio_in_video` in Qwen2.5 Omni (#7638 ) * cache _mm_inputs * nit * support for use_audio_in_video * remove cache * fix data * Update mllm_video_audio_demo.json	2025-04-08 18:40:10 +08:00
hoshi-hiyouga	fb46193364	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	40fb24916f	[model] add llama4 (#7611 )	2025-04-06 13:42:31 +08:00
Kingsley	ac9ba80128	[data] fix qwen2.5 omni plugin (#7573 ) * align key with qwen2vl * nit && change scripts	2025-04-02 21:28:52 +08:00
hoshi-hiyouga	be0289292d	[infer] vllm video/audio inference (#7566 )	2025-04-02 02:27:04 +08:00
hoshi-hiyouga	37d783149d	[model] fix kv cache (#7564 )	2025-04-01 23:07:46 +08:00
Yu Shi Jie	69b0c1cf4f	[model] fix use_cache patching for gemma3 multimodal (#7500 )	2025-04-01 16:06:48 +08:00
Kingsley	1189aeb6c2	[model] add Qwen2.5-Omni model (#7537 ) * preserve image_sizes * preserve image_sizes * init plugin * support audio-text2text lora * nit * support image/video-text2text, audio-text2text * remove args * remove lines * add docs && nit * remove some comments * fix && add merge part script * add license	2025-03-31 20:39:35 +08:00
Xiaosu Zhu	d38c402f63	[misc] update liger-kernel's monkey patch (#7453 ) * Update liger_kernel.py * Update setup.py	2025-03-25 11:58:52 +08:00
AbdelKarim ELJANDOUBI	ce089ef8f6	[misc] enable liger kernel for gemma3 text and paligemma (#7466 ) * add gemma3 text * add paligemma (1,2 and 2 mix)	2025-03-25 09:27:43 +08:00
Kenny Lam	cad8bde6b1	[misc] enable liger kernel for gemma3 (#7462 )	2025-03-24 19:09:59 +08:00
hoshi-hiyouga	e7ae755ab6	[data] gemma3 plugin pan and scan (#7294 ) * gemma3 pan and scan * add test case * fix test	2025-03-13 23:29:23 +08:00
hoshi-hiyouga	1b1964714e	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	a54c859674	[model] support gemma3 (#7273 )	2025-03-13 01:35:23 +08:00
hoshi-hiyouga	efa86e730c	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00
hoshi-hiyouga	c6331546a9	[config] update args (#7231 ) Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	5deefc6094	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b7ccfd28d1	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	1cda37892e	[misc] fix lora regex (#6944 ) * fix lora regex * fix Former-commit-id: 1d0ecbaee1b72f1e03154ddd4fcc8b7876e01f89	2025-02-14 21:38:43 +08:00
hoshi-hiyouga	6ebe81e04d	[misc] fix grad ckpt (#6931 ) Former-commit-id: deae1fc9a0bea5c8b8be1564cf9c81c9c02a0b3a	2025-02-13 23:27:51 +08:00
hoshi-hiyouga	a9b4e229af	[model] add liger kernel to qwen2_5 vl (#6930 ) * add liger kernel to qwen2_5 vl * fix patch * fix patch Former-commit-id: 828776d155986166498dfc907194f64436571106	2025-02-13 23:05:54 +08:00
hoshi-hiyouga	8cbfa350fd	[misc] fix grad ckpt func (#6916 ) Former-commit-id: 35e069a52b3d7cfd9b0107574b09265eb2290f0b	2025-02-13 00:17:18 +08:00
hoshi-hiyouga	c322512037	[deps] upgrade vllm (#6857 ) Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd	2025-02-08 15:02:28 +08:00
Zhangchi Feng	46a1786595	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 5eacb5629e4d7733cd992a63747a1335f2c6a929	2025-02-05 04:59:09 +08:00
hoshi-hiyouga	40b6e9045d	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: d9ae594178796994d400a5f207d6499712816f89	2025-02-05 01:53:33 +08:00
Zhangchi Feng	5f9e4d01bd	[data] fix minicpmv plugin (#6801 ) * fix template name * tiny fix * support minicpm-o-2.6 * support inference of minicpmv * update readme * support dpo of minicpmv * update init audio * update init audio * [model]fix image process in minicpmo Former-commit-id: 8f704c8b6228ef50f828014f85dce67fda868660	2025-02-04 21:20:15 +08:00
hoshi-hiyouga	e335c548c1	[model] add mistral small models (#6786 ) Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744	2025-02-01 04:31:38 +08:00
hoshi-hiyouga	1132aaa53c	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: ed46fb4f6194c30060b908092464dded12e5787c	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	46068b3324	[breaking] support transformers 4.48 (#6628 ) Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508	2025-01-31 01:36:33 +08:00
hoshi-hiyouga	87db2a849a	[model] support yarn (#6693 ) Former-commit-id: 8c412abc44a4c61b683465e36c6288580d980250	2025-01-18 13:56:09 +08:00
hoshi-hiyouga	b2f6d001bf	fix qwen2 moe (#6684 ) Former-commit-id: ab624419fa0ab23ef7a331a0ec14e393328772b5	2025-01-17 13:46:09 +08:00
hoshi-hiyouga	33d420bbcc	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a	2025-01-15 01:42:50 +08:00
zhuHQ	9b29a431db	[optim] add support to APOLLO (#6617 ) Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	b51ade6d86	lint (#6641 ) Former-commit-id: 79731ae13ecd17eb8646fb53162c81dddfef3b00	2025-01-14 18:40:07 +08:00
Haian Huang(深度眸)	9b224eb61a	Support InternLM3 Dense 8B Model (#6640 ) * support internlm3 * update * update * update * add hint Former-commit-id: 24ab7ae0944c5f373e9cac60f0332e704824a057	2025-01-14 18:07:27 +08:00
Xiaosu Zhu	1e2b1cedec	Fix tokenizer max length (#6632 ) Former-commit-id: 1807c7ba033985490aa7c8c39d880da6af983b92	2025-01-14 17:35:54 +08:00

1 2 3 4

198 Commits