LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-16 00:55:59 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	1a7c872c14	[deps] upgrade transformers to 4.50.0 (#7437 ) * upgrade transformers * fix hf cache * fix dpo trainer	2025-03-23 17:44:27 +08:00
Eric Tang	8f09c0bf96	[3rdparty] fix redundant process group destroy for ray (#7395 ) * fix redundant process group destroy for ray * Update tuner.py --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-03-21 10:56:47 +08:00
hoshi-hiyouga	1b1964714e	[misc] update format (#7277 )	2025-03-13 02:53:08 +08:00
hoshi-hiyouga	efa86e730c	[misc] upgrade format to py39 (#7256 )	2025-03-12 00:08:41 +08:00
hiyouga	3722d04db1	release v0.9.2 Former-commit-id: e7ed1782d4a006400de6fc0f864abd01f7fadeea	2025-03-11 14:49:13 +08:00
Ze-Yi LIN	1358ad9afd	[tracking] add swanlab_logdir param (#7219 ) * feat: add swanlab_logdir param * fix Former-commit-id: 9215ad488b6ac6cd57fe8fa4acdacceb63f68ca5	2025-03-11 00:53:07 +08:00
hoshi-hiyouga	8c7917d1a2	[data] fix loader (#7207 ) * fix dataloader * add test case * fix type * fix ci * fix ci * fix ci * disable overwrite cache in ci Former-commit-id: e84af0e140b1aafd1a6d6fe185a8e41c8fc5f831	2025-03-07 17:20:46 +08:00
hoshi-hiyouga	63e4b14565	[misc] fix cli (#7204 ) Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4	2025-03-07 15:01:18 +08:00
hoshi-hiyouga	25546b9afe	[model] add QwQ 32b (#7179 ) Former-commit-id: 8897e48b8cd55407812453ddd4ff98ac7bdc4e91	2025-03-06 11:58:36 +08:00
Ze-Yi LIN	754dbb8b07	[trainer] fix swanlab callback (#7176 ) Former-commit-id: 6d9acf4bd30db24499118aee16bd19cb19ba9e3d	2025-03-06 00:33:37 +08:00
hoshi-hiyouga	54bcc37f55	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
Ze-Yi LIN	b531afb74e	[webui] display swanlab exp link (#7089 ) * webui add swanlab link * change callback name * update --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b	2025-02-27 19:40:54 +08:00
Eric Tang	413aa5944a	[ray] specify ray storage path (#6920 ) Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49	2025-02-14 21:55:41 +08:00
Billy Cao	680648098b	[trainer] fix gen_kwarg to eval during training (#5451 ) * Correctly pass gen_kwarg to eval during model runs * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 845d16122496311e08263610a6a922f82604de7b	2025-02-13 02:35:06 +08:00
marko1616	a23e16ae89	[trainer] fix llama3.2 vision kto train (#6904 ) Former-commit-id: 1563e89adc8988fc6e4250634a3f1e385979b0e5	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	c5649d7149	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	ca5cd8276c	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	c322512037	[deps] upgrade vllm (#6857 ) Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd	2025-02-08 15:02:28 +08:00
hoshi-hiyouga	40b6e9045d	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year Former-commit-id: d9ae594178796994d400a5f207d6499712816f89	2025-02-05 01:53:33 +08:00
hoshi-hiyouga	46068b3324	[breaking] support transformers 4.48 (#6628 ) Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508	2025-01-31 01:36:33 +08:00
yinpu	5062b099f7	fix: avoid redundant normalization in DPO's SFT loss calculation (#6722 ) Former-commit-id: 971a8ccbdacf130763d40c7ef82a711b2fc1292f	2025-01-21 13:38:02 +08:00
hoshi-hiyouga	33d420bbcc	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a	2025-01-15 01:42:50 +08:00
zhuHQ	9b29a431db	[optim] add support to APOLLO (#6617 ) Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	7ab274eb67	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f	2025-01-13 21:34:20 +08:00
hiyouga	760dea0787	imporve log Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8	2025-01-08 09:56:10 +00:00
hiyouga	bd61102b5e	fix llamaboard with ray Former-commit-id: bd8a432d6a980b1b24a551626304fe3d394b1baf	2025-01-07 09:59:24 +00:00
hiyouga	708e899769	refactor ray integration, support save ckpt Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2	2025-01-07 09:39:10 +00:00
Eric Tang	88e9badcbb	run style check Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	09a17b5415	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960	2025-01-07 08:55:44 +00:00
hiyouga	8d1b77cd6f	fix #6546 Former-commit-id: 6fcf2f10faf3b1614896b091591eeef96d717e64	2025-01-07 06:30:44 +00:00
hiyouga	9e7a7c5651	fix #6499 Former-commit-id: dffc607220ff6dac15cf501ac9a3cdbe80c25211	2025-01-02 11:28:54 +00:00
hiyouga	92c6c384cf	fix #6482 Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7	2024-12-30 06:03:07 +00:00
hiyouga	c555a83ec9	fix #6448 Former-commit-id: 04f78e85af5af14b4c195936623e426a6a128af2	2024-12-27 16:54:39 +00:00
hiyouga	c57fbebd55	support report custom args Former-commit-id: d41254c40a1c5cacf9377096adb27efa9bdb79ea	2024-12-21 21:42:45 +00:00
hoshi-hiyouga	da8a72d611	Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab feat: add swanlab for experiment tracking and visualization. Former-commit-id: e65fe507f7643bf40b0fc462805c7b7f8ef6b738	2024-12-21 14:09:33 +08:00
ZeYi Lin	6d13503867	fix: by hiyouga suggestion Former-commit-id: 41195f1bc69e4b5da7a265369d368b06754362cf	2024-12-20 16:43:03 +08:00
ZeYi Lin	9d27de776c	feat: ui improve Former-commit-id: 6a1effb1741a13ae5238b0e9b429b4cbe3b6534f	2024-12-20 11:03:02 +08:00
ZeYi Lin	87a8d25f76	fix: bugs Former-commit-id: a2297f97f7587c77d55fbce9ffa81dc60d0b04a1	2024-12-19 21:08:16 +08:00
hiyouga	b58c350c1a	fix #6391 Former-commit-id: 067ba6e6cb4d8a1d95bba0a108f73008416a2865	2024-12-19 12:16:38 +00:00
ZeYi Lin	768914653e	feat: optimize frontend Former-commit-id: 4a78603c141d9bd78bcaf81261b443cf082bf51f	2024-12-19 19:04:19 +08:00
ZeYi Lin	ec2bee271d	feat: swanlab params Former-commit-id: 761b3bdb03e27826fde2ca86d4e37b53c2bbc777	2024-12-19 18:47:27 +08:00
hiyouga	67479ce5d9	support disable shuffling Former-commit-id: 9d8c35fd6b838ede0bd6827c6c6121f2cba2b11b	2024-12-19 08:53:21 +00:00
hiyouga	cf10c2dff8	add swanlab Former-commit-id: c85a77c8a8824a56a67d56b97b4877fcd6edeb3d	2024-12-19 07:12:31 +00:00
hiyouga	93e266e483	support control eos, fix #6345 Former-commit-id: cb0f8399356bf372f3b7963f2565c3d504be0923	2024-12-17 10:42:05 +00:00
hiyouga	90b62807fb	fix #6348 Former-commit-id: 83e552320909f4775377889f1512994b7e638a7e	2024-12-17 10:06:46 +00:00
hiyouga	b57560aa25	fix mrope Former-commit-id: 55bee1d333549ca19858b3f5c1b7b86926e5fb09	2024-12-12 15:08:17 +00:00
hiyouga	51b18e565d	support batch infer in vllm Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5	2024-12-04 13:50:00 +00:00
hoshi-hiyouga	874cdc101b	Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation support effective tokens calculation on sft/dpo Former-commit-id: d0510e6d49b43c5ffadd8af653c3bdecc1582417	2024-11-20 13:43:15 +08:00
Ting	1c8514dc8d	code refactor Former-commit-id: ee3f85aa9677d0aeecb3bc396530d2cd7c50dce5	2024-11-19 20:33:18 +08:00
Ting	befe4310f0	update Former-commit-id: 516ed0ea5fed8c74fe3669a7e85dd89b5a0ec3c2	2024-11-19 19:12:10 +08:00

1 2 3 4

160 Commits