LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-11-07 22:00:03 +08:00

Author	SHA1	Message	Date
Ben Feuer	05271756d2	[feat] fp8 training (#8960 ) Co-authored-by: Benjamin Feuer <penfever@gmail.com> Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>	2025-10-01 14:32:53 +08:00
hoshi-hiyouga	b0c8ba73e0	[deps] update to transformers 4.52 (#8125 )	2025-05-21 05:16:18 +08:00
Saiya	820ed764c4	[infer] support lora adapter for SGLang backend (#8067 )	2025-05-16 23:33:47 +08:00
Kingsley	1dd67eb042	[data] fix internvl plugin (#7817 )	2025-04-23 00:58:22 +08:00
hoshi-hiyouga	a62cba3d05	[example] add bash usage (#7794 )	2025-04-22 00:25:51 +08:00
flashJd	1302ca39f6	[misc] fix new tokens adding (#7253 ) Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-21 23:19:02 +08:00
hoshi-hiyouga	5817cda37e	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	903db09822	[infer] vllm video/audio inference (#7566 )	2025-04-02 02:27:04 +08:00
Kingsley	185c76f6ad	[model] add Qwen2.5-Omni model (#7537 ) * preserve image_sizes * preserve image_sizes * init plugin * support audio-text2text lora * nit * support image/video-text2text, audio-text2text * remove args * remove lines * add docs && nit * remove some comments * fix && add merge part script * add license	2025-03-31 20:39:35 +08:00
Qiaolin Yu	30038d9ce7	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	ef5f1c1def	[data] gemma3 plugin pan and scan (#7294 ) * gemma3 pan and scan * add test case * fix test	2025-03-13 23:29:23 +08:00
hoshi-hiyouga	9ccfb97a2c	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	7c1640ed5f	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00
hoshi-hiyouga	5a29f49fb1	[config] update args (#7231 ) Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	113cc3d920	[misc] fix cli (#7204 ) Former-commit-id: bd1722355944a3618f44a4ac3e8177c34b97dc75	2025-03-07 15:01:18 +08:00
hoshi-hiyouga	1f4a0b11ba	[data] update vlm args (#6976 ) Former-commit-id: 3da2cc2710c9b13ab450815a92fff14b03251984	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b1d31ff0f9	[data] add min resolution option (#6975 ) Former-commit-id: 7faecc0301709326efa21e7a3fdb75fe0a9635c2	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	c6be9e242c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 9184a6e0ed7ff5f632c848f861bfa448c4cd06fc	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	e8c1979b79	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	1efe525df7	[model] support yarn (#6693 ) Former-commit-id: 1f47b6186c267de86cbdbd47ba2adbf1f9db7f39	2025-01-18 13:56:09 +08:00
hiyouga	47c2d91933	support report custom args Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4	2024-12-21 21:42:45 +00:00
Yaser Afshar	76ebd62ac1	Add missing key to init_kwargs Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e	2024-12-17 12:34:05 +00:00
Yaser Afshar	fe4546a7bb	Add trust_remote_code parameter and remove True - Introduced a new model parameter `trust_remote_code` - Set the default value of `trust_remote_code` to `False` to enhance security Former-commit-id: 09437763267bc7081159a6878cee9652a2b1ddac	2024-12-17 12:25:12 +00:00
hiyouga	64bac4bc7e	support non-reenterent-gc & fix #6358 Former-commit-id: f319da6937964c5068864f7815efbf3847bd4ad8	2024-12-17 11:41:59 +00:00
hiyouga	e99031daa4	fix inputs Former-commit-id: 446441fdb020b5a102480251cb8536dd8b3f8f99	2024-11-23 18:26:02 +00:00
hiyouga	0d18cca0db	add vllm config Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76	2024-11-10 21:28:18 +08:00
huniu20	e8e98bb125	add om_hub_token argument Former-commit-id: 7b91be33c9cd8473453716f0c4c6dec924304efc	2024-10-10 17:16:46 +08:00
hiyouga	4df090ff48	fix #5542 Former-commit-id: fe7ffccdb9a45b31e20ab7e88282a75b45504a97	2024-09-30 23:28:55 +08:00
hiyouga	0daee7cb39	support activation offloading via unsloth gc Former-commit-id: fb72a3adb0916232cc9ac9f0c725c02d07b9354c	2024-09-08 01:22:19 +08:00
hiyouga	3aa6a3e45b	add e2e tests Former-commit-id: 94d5b1bd8f49dabeb9e3c53d634cfb3c06b0241d	2024-09-05 21:52:28 +08:00
hiyouga	9df7a26e6b	video datasets Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89	2024-09-05 02:04:17 +08:00
hiyouga	5ef58eb655	fix #5334 Former-commit-id: 59d2b31e968677263f005f57ae8a56fc758307a7	2024-09-03 19:09:42 +08:00
hiyouga	bfdcc6bacf	add rlhf-v dataset Former-commit-id: 8e49940746c1a6ff910f07dbefbec14af9d0f3c6	2024-09-01 22:57:41 +08:00
hiyouga	f31e7e0dfc	remove visual_inputs, fix qlora Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8	2024-08-31 00:24:51 +08:00
hiyouga	a83756b5e9	refactor mm training Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a	2024-08-30 02:14:31 +08:00
hiyouga	c765292093	support liger kernel Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34	2024-08-27 11:20:14 +08:00
hiyouga	ff6fc666c1	update hparams Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec	2024-07-03 23:18:58 +08:00
ancv	7f42932957	move efficient_packing from data_args to model_args Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5	2024-07-02 18:37:55 +07:00
hiyouga	d2d9fa4abb	support HQQ/EETQ #4113 Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed	2024-06-27 00:29:42 +08:00
hiyouga	47651a94a3	fix #4410 Former-commit-id: fca893d73c3d7bbb87a816522f2e1568d3e9c612	2024-06-24 22:34:31 +08:00
stceum	9aa640f27b	Bug Fix: `off` is parsed as `False` in yaml file, changed to `disabled` to avoid this. Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16	2024-06-24 20:39:31 +08:00
hiyouga	f25b8626bf	support pissa Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f	2024-06-16 01:08:12 +08:00
hiyouga	96b82ccd4d	use fixture Former-commit-id: 80a9e6bf94cf14fa63e6b6cdf7e1ce13722c8b5e	2024-06-15 20:06:17 +08:00
hiyouga	2946153cea	add license Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56	2024-06-15 17:54:33 +08:00
hiyouga	a3f4925c2c	add test cases Former-commit-id: b27269bd2b52fb9d43cde8a8b7f293099b0127a2	2024-06-15 04:05:54 +08:00
hiyouga	99ce085415	fix lint Former-commit-id: 713fde4259233af645bade7790211064a07a2a6f	2024-06-13 00:48:44 +08:00
hiyouga	5834651c4a	fix #4198 Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c	2024-06-11 15:38:38 +08:00
hiyouga	4f0ce9be4e	reorganize adapter code Former-commit-id: 54cd743ebfbd296ae9eaf10c33f59e127f451785	2024-06-08 00:47:23 +08:00
hoshi-hiyouga	d31c9c73c7	Merge pull request #4080 from MengqingCao/npu Add npu option for model exporting Former-commit-id: ca459f67eb319c262d5bab882e9cb3535a76eb69	2024-06-06 03:15:44 +08:00
hoshi-hiyouga	d9a372658a	Update model_args.py Former-commit-id: af2c3cbee4372724cf00379d8494f5000dc8a3df	2024-06-06 03:14:23 +08:00

1 2

62 Commits