LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-19 05:10:35 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	64a6fb9b50	[model] add QwQ 32b (#7179 )	2025-03-06 11:58:36 +08:00
Ze-Yi LIN	8ad03258e1	[trainer] fix swanlab callback (#7176 )	2025-03-06 00:33:37 +08:00
hoshi-hiyouga	b4b89b4ff3	[trainer] update config (#7174 )	2025-03-05 23:32:54 +08:00
Ze-Yi LIN	891c487503	[webui] display swanlab exp link (#7089 ) * webui add swanlab link * change callback name * update --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-02-27 19:40:54 +08:00
Eric Tang	6edd4992d7	[ray] specify ray storage path (#6920 )	2025-02-14 21:55:41 +08:00
Billy Cao	11eac71c13	[trainer] fix gen_kwarg to eval during training (#5451 ) * Correctly pass gen_kwarg to eval during model runs * fix * fix --------- Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-02-13 02:35:06 +08:00
marko1616	b7fd1e9c00	[trainer] fix llama3.2 vision kto train (#6904 )	2025-02-12 19:09:14 +08:00
hoshi-hiyouga	e1a7c1242c	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	9184a6e0ed	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	5f38bcaba9	[deps] upgrade vllm (#6857 )	2025-02-08 15:02:28 +08:00
hoshi-hiyouga	e2dc5b952a	[misc] update license year & fix llama pro (#6814 ) * fix llamapro script * change year	2025-02-05 01:53:33 +08:00
hoshi-hiyouga	15357cdad9	[breaking] support transformers 4.48 (#6628 )	2025-01-31 01:36:33 +08:00
yinpu	0f45982bac	fix: avoid redundant normalization in DPO's SFT loss calculation (#6722 )	2025-01-21 13:38:02 +08:00
hoshi-hiyouga	7a04021d04	[optim] clean apollo (#6645 ) * clean apollo code * update readme	2025-01-15 01:42:50 +08:00
zhuHQ	d9189f9f0b	[optim] add support to APOLLO (#6617 )	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	e3e2c8c689	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples	2025-01-13 21:34:20 +08:00
hiyouga	47e17dd689	imporve log	2025-01-08 09:56:10 +00:00
hiyouga	c46675d5e5	fix llamaboard with ray	2025-01-07 09:59:24 +00:00
hiyouga	d8cac6f546	refactor ray integration, support save ckpt	2025-01-07 09:39:10 +00:00
Eric Tang	1e8e7be0a5	run style check	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	163ddb680b	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>	2025-01-07 08:55:44 +00:00
hiyouga	870f23d7ea	fix #6546	2025-01-07 06:30:44 +00:00
hiyouga	1800f8c72d	fix #6499	2025-01-02 11:28:54 +00:00
hiyouga	6f5bb3b8e5	fix #6482	2024-12-30 06:03:07 +00:00
hiyouga	2719867982	fix #6448	2024-12-27 16:54:39 +00:00
hiyouga	5111cac6f8	support report custom args	2024-12-21 21:42:45 +00:00
hoshi-hiyouga	947e22a4a3	Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab feat: add swanlab for experiment tracking and visualization.	2024-12-21 14:09:33 +08:00
ZeYi Lin	3a7ea2048a	fix: by hiyouga suggestion	2024-12-20 16:43:03 +08:00
ZeYi Lin	5f6dafd70e	feat: ui improve	2024-12-20 11:03:02 +08:00
ZeYi Lin	d0eb64d5e3	fix: bugs	2024-12-19 21:08:16 +08:00
hiyouga	d4c1fda1ad	fix #6391	2024-12-19 12:16:38 +00:00
ZeYi Lin	8c2df41b93	feat: optimize frontend	2024-12-19 19:04:19 +08:00
ZeYi Lin	d5cf87990e	feat: swanlab params	2024-12-19 18:47:27 +08:00
hiyouga	c7cedc7569	support disable shuffling	2024-12-19 08:53:21 +00:00
hiyouga	96f8f103e5	add swanlab	2024-12-19 07:12:31 +00:00
hiyouga	eda76de32b	support control eos, fix #6345	2024-12-17 10:42:05 +00:00
hiyouga	142191e466	fix #6348	2024-12-17 10:06:46 +00:00
hiyouga	2811814fc4	fix mrope	2024-12-12 15:08:17 +00:00
hiyouga	1324d158f9	support batch infer in vllm	2024-12-04 13:50:00 +00:00
hoshi-hiyouga	bd639a137e	Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation support effective tokens calculation on sft/dpo	2024-11-20 13:43:15 +08:00
Ting	40627c601e	code refactor	2024-11-19 20:33:18 +08:00
Ting	f566ecc8d1	update	2024-11-19 19:12:10 +08:00
Ting	ef6e14550d	update	2024-11-19 19:10:07 +08:00
Ting	b9f00286d8	support efficient tokens calculation on sft/dpo	2024-11-19 17:15:47 +08:00
hoshi-hiyouga	dc82821872	fix #6050	2024-11-16 16:11:16 +08:00
hiyouga	4270f7dfb9	fix dpo metrics	2024-11-02 20:59:01 +08:00
hiyouga	c38aa29336	support rank0 logger	2024-11-02 18:31:04 +08:00
hiyouga	93d3b8f43f	update tests	2024-11-02 12:41:44 +08:00
hiyouga	30567a1487	fix incorrect loss value for vlms	2024-10-30 08:56:46 +00:00
hiyouga	23dbe9a099	fix #5749	2024-10-29 13:02:13 +00:00

1 2 3 4

152 Commits