LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-21 22:58:58 +08:00

Author	SHA1	Message	Date
hiyouga	90e14a960d	tiny fix	2024-06-11 12:48:53 +08:00
hiyouga	3f24337a8a	tiny fix	2024-06-11 01:04:16 +08:00
hiyouga	91e62a098f	set dev version	2024-06-11 00:50:53 +08:00
hiyouga	2b6ebd6b51	release v0.8.1	2024-06-11 00:44:26 +08:00
hiyouga	a793e8456b	fix #4160 The split heads should be concatenated in dim=2	2024-06-11 00:37:17 +08:00
hiyouga	0012762b04	update evaluator	2024-06-10 23:56:00 +08:00
hiyouga	c907d81667	fix #2666	2024-06-10 21:24:15 +08:00
mMrBun	950e360ca0	Optimize the handling of QWEN2 in scenarios involving multiple tool calls.	2024-06-10 02:00:14 +08:00
mMrBun	6ed0b0c800	Removed unnecessary comments.	2024-06-09 18:25:22 +08:00
mMrBun	0f2609ce19	Merge branch 'hiyouga:main' into main	2024-06-09 18:17:24 +08:00
mMrBun	cb1cbcb293	Implemented the tool_formatter and tool_extractor for glm4 tool_format	2024-06-09 18:16:15 +08:00
hiyouga	972ec9c668	fix llamafactory-cli env	2024-06-08 07:15:45 +08:00
hiyouga	3ac11e77cc	set dev version	2024-06-08 06:46:09 +08:00
hiyouga	5aa4ce4756	release v0.8.0	2024-06-08 05:20:54 +08:00
hiyouga	54cd743ebf	reorganize adapter code	2024-06-08 00:47:23 +08:00
hoshi-hiyouga	cfd62283a9	fix #4139	2024-06-08 00:45:02 +08:00
hiyouga	06e5d136a4	add resume args in webui	2024-06-08 00:22:16 +08:00
hiyouga	8bf9da659c	fix #4137	2024-06-07 19:16:06 +08:00
hiyouga	f8d8690bf4	tiny fix	2024-06-07 05:19:21 +08:00
hiyouga	4489d73ac7	fix ppo trainer save zero3 model accelerator.get_state_dict(ds_model) should be called at all ranks	2024-06-07 05:14:19 +08:00
hiyouga	2702d7e952	fix ppo in trl 0.8.6	2024-06-07 04:48:29 +08:00
hiyouga	f9e818d79c	fix #4120	2024-06-07 04:18:05 +08:00
hiyouga	ccc8b64cc2	update data processors	2024-06-07 04:15:40 +08:00
hoshi-hiyouga	181dbb0d05	Merge pull request #4009 from AlongWY/main supervised packing with greedy knapsack algorithm	2024-06-07 03:48:46 +08:00
hoshi-hiyouga	c09ad8bab3	Update supervised.py	2024-06-07 03:42:08 +08:00
hoshi-hiyouga	788e8232fc	Update supervised.py	2024-06-07 03:38:23 +08:00
hoshi-hiyouga	8cecade708	Update supervised.py	2024-06-07 03:38:04 +08:00
hiyouga	8e95648850	add qwen2 models	2024-06-07 00:22:57 +08:00
hiyouga	74f96efef9	rename files	2024-06-07 00:09:06 +08:00
hiyouga	45d8be8f93	add DISABLE_TORCHRUN option	2024-06-06 23:44:58 +08:00
hoshi-hiyouga	55c18c49b0	Merge pull request #4082 from MengqingCao/bugfix Fix #4077	2024-06-06 23:38:40 +08:00
hoshi-hiyouga	751dd77bc0	Update cli.py	2024-06-06 23:38:09 +08:00
hiyouga	76c61905b2	fix ppo+zero3 #3108	2024-06-06 23:30:07 +08:00
hiyouga	451b6693c0	fix torch gc	2024-06-06 20:30:25 +08:00
hiyouga	149610c636	fix ppo dataset bug #4012	2024-06-06 19:03:20 +08:00
hiyouga	fad2591e31	update trainers	2024-06-06 18:45:49 +08:00
hiyouga	67aa78cde0	fix base64 image read #4061	2024-06-06 17:29:19 +08:00
hiyouga	cae4737907	lora modules: all by default	2024-06-06 03:53:28 +08:00
hiyouga	c23cc63d3d	add codestral 22B	2024-06-06 03:42:50 +08:00
hiyouga	7daf8366db	lint	2024-06-06 03:33:44 +08:00
hoshi-hiyouga	f2580ad403	Merge pull request #4066 from injet-zhou/main add throughput entry to training log	2024-06-06 03:32:04 +08:00
hoshi-hiyouga	ca459f67eb	Merge pull request #4080 from MengqingCao/npu Add npu option for model exporting	2024-06-06 03:15:44 +08:00
hoshi-hiyouga	feaee36c46	Update export.py	2024-06-06 03:14:46 +08:00
hoshi-hiyouga	af2c3cbee4	Update model_args.py	2024-06-06 03:14:23 +08:00
hoshi-hiyouga	0e740aa463	Merge pull request #4053 from hzhaoy/feature/add_select_config_file Support selecting saved configuration files	2024-06-06 03:06:03 +08:00
hiyouga	8fcc79e1e6	add vllm_dtype arg #3387 #3717	2024-06-06 02:53:27 +08:00
hiyouga	a12a506c3d	support train from scratch #4033 #4075	2024-06-06 02:43:19 +08:00
hiyouga	946f601136	support image input in api #3971 #4061	2024-06-06 02:29:55 +08:00
hiyouga	dc4a00dd63	update train hparams	2024-06-06 01:49:20 +08:00
hiyouga	d4908d5708	add llamafactory-cli env	2024-06-06 01:28:14 +08:00

1 2 3 4 5 ...

1279 Commits