LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-15 03:10:35 +08:00

Author	SHA1	Message	Date
hiyouga	7924ffc55d	support llama pro #2338 , add rslora	2024-02-15 02:27:36 +08:00
hiyouga	0ae9a16b9d	update gc kwargs	2024-02-07 00:38:24 +08:00
hiyouga	ebf31b62eb	fix #2438	2024-02-06 15:23:08 +08:00
hiyouga	4ecadc3512	fix #2376	2024-02-03 23:14:31 +08:00
hiyouga	521ad76552	fix autoset attn impl, update data readme	2024-01-31 11:58:07 +08:00
hiyouga	2bc30763e9	fix #2320	2024-01-24 16:19:18 +08:00
ldwang	c284665425	Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3. Signed-off-by: ldwang <ftgreat@gmail.com>	2024-01-24 15:25:31 +08:00
ldwang	18923b1402	Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3. Signed-off-by: ldwang <ftgreat@gmail.com>	2024-01-24 14:43:16 +08:00
hiyouga	e4ba1deedf	add hint	2024-01-22 23:32:01 +08:00
hoshi-hiyouga	bdc9eff635	Update patcher.py	2024-01-22 23:27:39 +08:00
A-Cepheus	b06a31e76a	🐞 fix: typo	2024-01-22 16:04:39 +08:00
A-Cepheus	319a72b48d	🐞 fix: typo, move MoE fix to patcher	2024-01-22 16:01:58 +08:00
hiyouga	e0a717aa3a	fix #2268	2024-01-21 14:11:38 +08:00
hiyouga	638234ceee	format style	2024-01-20 20:15:56 +08:00
hiyouga	38af076a75	support longlora for main branch	2024-01-20 19:25:22 +08:00
hoshi-hiyouga	bb92cdd0db	Merge pull request #2201 from liu-zichen/token_embed_resize support resize embed for zero3	2024-01-20 17:45:38 +08:00
hiyouga	8cbe4e9609	add upcast_lmhead option	2024-01-19 23:54:25 +08:00
hiyouga	0ff9a1fb4f	set use_reentrant=False	2024-01-19 23:29:54 +08:00
hiyouga	d9f1cae351	support function calling	2024-01-18 09:54:23 +08:00
liuzc	a5f6a7f4fb	support resize embed for zero3	2024-01-16 15:16:20 +08:00
hiyouga	d2a676c8ba	improve model export	2024-01-05 18:51:49 +08:00
hiyouga	f6fdd83f8a	fix #2098	2024-01-05 17:11:26 +08:00
hiyouga	33f2c0d4f8	fix #2081	2024-01-04 23:19:08 +08:00
hiyouga	1696698eb9	fix dispatch	2024-01-03 16:33:16 +08:00
hiyouga	24d8d6f224	fix valuehead patch	2024-01-03 16:19:23 +08:00
hiyouga	55021097d5	fix rm server	2024-01-03 15:30:46 +08:00
hiyouga	e4bb846c43	fix bug	2023-12-24 19:20:12 +08:00
hiyouga	6629087e12	update loader	2023-12-24 19:10:23 +08:00
hiyouga	e44b82ee24	update patcher	2023-12-23 15:24:27 +08:00
hiyouga	7aad0b889d	support unsloth	2023-12-23 00:14:33 +08:00
hiyouga	083355fc05	fix ds zero3 check	2023-12-21 01:19:22 +08:00
hiyouga	624cc21281	improve quantization	2023-12-20 18:27:16 +08:00
hiyouga	c4a3977ad7	add max_memory for gptq #1923	2023-12-20 18:15:17 +08:00
hiyouga	71389be37c	support autogptq in llama board #246	2023-12-16 16:31:30 +08:00
hiyouga	3524aa1e58	support quantization in export model	2023-12-15 23:44:50 +08:00
hiyouga	2740aa9cbb	add configurer	2023-12-15 21:46:40 +08:00

36 Commits