LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-10 13:56:00 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	4831552856	[infer] set env for vllm ascend (#7745 )	2025-04-17 01:08:55 +08:00
hoshi-hiyouga	0fe5631f9b	[deps] upgrade vllm (#7728 )	2025-04-15 14:57:40 +08:00
hoshi-hiyouga	3ef36d0057	[misc] upgrade cli (#7714 )	2025-04-14 15:41:22 +08:00
Eric Tang	39c1e29ed7	[ray] allow for specifying ray.init kwargs (i.e. runtime_env) (#7647 ) * ray init kwargs * Update trainer_utils.py * fix ray args --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-04-10 11:31:05 +08:00
hoshi-hiyouga	5817cda37e	[misc] fix packing and eval plot (#7623 )	2025-04-07 18:20:57 +08:00
hoshi-hiyouga	903db09822	[infer] vllm video/audio inference (#7566 )	2025-04-02 02:27:04 +08:00
hoshi-hiyouga	aaf2e6ba2a	[model] fix kv cache (#7564 )	2025-04-01 23:07:46 +08:00
Billy Cao	5d1cc863a4	[data] shard the dataset to allow multiprocessing when streaming is enabled (#7530 ) * Shard the dataset when streaming to allow multiprocessing * Allow user to not set dataset_shards to ensure backward compatibility	2025-04-01 15:36:23 +08:00
Kingsley	185c76f6ad	[model] add Qwen2.5-Omni model (#7537 ) * preserve image_sizes * preserve image_sizes * init plugin * support audio-text2text lora * nit * support image/video-text2text, audio-text2text * remove args * remove lines * add docs && nit * remove some comments * fix && add merge part script * add license	2025-03-31 20:39:35 +08:00
Xu-pixel	f547334604	[3rdparty] support swanlab lark notification (#7481 )	2025-03-27 01:52:01 +08:00
hoshi-hiyouga	dfbe1391e9	[deps] upgrade vllm to 0.8 (#7436 )	2025-03-23 14:32:22 +08:00
Qiaolin Yu	30038d9ce7	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	ef5f1c1def	[data] gemma3 plugin pan and scan (#7294 ) * gemma3 pan and scan * add test case * fix test	2025-03-13 23:29:23 +08:00
hoshi-hiyouga	9ccfb97a2c	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	7c1640ed5f	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00
Ze-Yi LIN	0a43bc1960	[tracking] add swanlab_logdir param (#7219 ) * feat: add swanlab_logdir param * fix Former-commit-id: `a1e76af3d9`	2025-03-11 00:53:07 +08:00
hoshi-hiyouga	5a29f49fb1	[config] update args (#7231 ) Former-commit-id: `ed8b12e3cb`	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	4e68828e46	[config] fix export max len (#7230 ) Former-commit-id: `728c2f6819`	2025-03-10 16:46:08 +08:00
hoshi-hiyouga	113cc3d920	[misc] fix cli (#7204 ) Former-commit-id: `bd17223559`	2025-03-07 15:01:18 +08:00
hoshi-hiyouga	e7556b591e	[deps] upgrade vllm (#7183 ) Former-commit-id: `d739fddb10`	2025-03-06 15:25:08 +08:00
hoshi-hiyouga	1f4a0b11ba	[data] update vlm args (#6976 ) Former-commit-id: `3da2cc2710`	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	b1d31ff0f9	[data] add min resolution option (#6975 ) Former-commit-id: `7faecc0301`	2025-02-18 01:40:46 +08:00
Eric Tang	e55ec42d3c	[ray] specify ray storage path (#6920 ) Former-commit-id: `6edd4992d7`	2025-02-14 21:55:41 +08:00
hoshi-hiyouga	036fb0d561	[misc] fix grad ckpt func (#6916 ) Former-commit-id: `e34c3c06da`	2025-02-13 00:17:18 +08:00
hoshi-hiyouga	2e2f6bea07	[data] feat: auto template (#6905 ) * support auto template * add unittest Former-commit-id: `2f8b6847f5`	2025-02-12 00:22:53 +08:00
hoshi-hiyouga	c6be9e242c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: `9184a6e0ed`	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	ff6658ad27	[deps] upgrade vllm (#6857 ) Former-commit-id: `5f38bcaba9`	2025-02-08 15:02:28 +08:00
hoshi-hiyouga	f70208e1c0	[misc] allow extra args (#6831 ) Former-commit-id: `74ade3a176`	2025-02-06 12:38:08 +08:00
Zhangchi Feng	01915eaf40	[model] support audio (#6701 ) * support qwen2_audio * improve code * lint * fix * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: `24c7842948`	2025-02-05 04:59:09 +08:00
hoshi-hiyouga	1fee69f874	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: `e2dc5b952a`	2025-02-05 01:53:33 +08:00
hoshi-hiyouga	e8c1979b79	[model] add qwen2.5 vl models (#6779 ) Former-commit-id: `999c7c8fe0`	2025-01-31 03:00:29 +08:00
hoshi-hiyouga	1efe525df7	[model] support yarn (#6693 ) Former-commit-id: `1f47b6186c`	2025-01-18 13:56:09 +08:00
hoshi-hiyouga	bbf334f823	disable valset by default (#6690 ) Former-commit-id: `77bbf65905`	2025-01-17 21:09:30 +08:00
steveepreston	8895cf1152	Update `val_size` english description (#6653 ) * Update `val_size` Description in locales.py * Update `val_size` Description in data_args.py * Remove extra space in data_args.py Former-commit-id: `76675b654e`	2025-01-15 16:00:20 +08:00
hoshi-hiyouga	9ef85f8fc4	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: `7a04021d04`	2025-01-15 01:42:50 +08:00
zhuHQ	763f9b9df0	[optim] add support to APOLLO (#6617 ) Former-commit-id: `d9189f9f0b`	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	5e699458e5	pin vllm version to 0.6.5 (#6629 ) Former-commit-id: `1c7663d304`	2025-01-14 02:44:02 +08:00
hiyouga	c89d17ab63	refactor mllm param logic Former-commit-id: `f6f630a1c9`	2025-01-10 15:45:48 +00:00
hoshi-hiyouga	b777fed171	Merge pull request #6564 from stephen-nju/fix_ray Fix ray Former-commit-id: `6b34b69fa6`	2025-01-08 18:14:18 +08:00
zhubin	014a7ea042	fix get ray args when args not a dict Former-commit-id: `9c4c84828b`	2025-01-08 10:06:02 +00:00
hiyouga	da542fad18	imporve log Former-commit-id: `47e17dd689`	2025-01-08 09:56:10 +00:00
hiyouga	b4174021d6	refactor ray integration, support save ckpt Former-commit-id: `d8cac6f546`	2025-01-07 09:39:10 +00:00
Eric Tang	bba52e258e	run style check Former-commit-id: `1e8e7be0a5`	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	1217240918	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Former-commit-id: `163ddb680b`	2025-01-07 08:55:44 +00:00
hiyouga	813f5919a3	fix #6482 Former-commit-id: `6f5bb3b8e5`	2024-12-30 06:03:07 +00:00
hiyouga	47c2d91933	support report custom args Former-commit-id: `5111cac6f8`	2024-12-21 21:42:45 +00:00
hoshi-hiyouga	547f76e56e	Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab feat: add swanlab for experiment tracking and visualization. Former-commit-id: `947e22a4a3`	2024-12-21 14:09:33 +08:00
ZeYi Lin	67d4757c35	fix: project blank Former-commit-id: `82e5d75014`	2024-12-20 18:26:02 +08:00
ZeYi Lin	cc703b58f5	fix: by hiyouga suggestion Former-commit-id: `3a7ea2048a`	2024-12-20 16:43:03 +08:00
ZeYi Lin	8f786ee938	feat: ui improve Former-commit-id: `5f6dafd70e`	2024-12-20 11:03:02 +08:00

1 2 3 4

177 Commits