LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-17 04:10:36 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	4d7bb69234	Update constants.py	2024-09-30 16:47:52 +08:00
shing100	3a9569647f	add Exaone3.0 template	2024-09-30 09:18:25 +09:00
hoshi-hiyouga	b257b91cd0	Update constants.py	2024-09-29 23:45:34 +08:00
BUAADreamer	bec1cb8d55	fix constants	2024-09-29 22:40:43 +08:00
BUAADreamer	485fc04716	fix constants	2024-09-29 22:00:01 +08:00
BUAADreamer	65a8923f5a	add more llava-next series template	2024-09-29 21:29:29 +08:00
BUAADreamer	6642cd501d	add llava-next/llava-next-video/video-llava	2024-09-28 00:57:03 +08:00
Zhangchi Feng	900631755b	Merge branch 'hiyouga:main' into main	2024-09-27 18:14:39 +08:00
hoshi-hiyouga	8e5d12c2c4	add modelscope models	2024-09-26 11:22:48 +08:00
marko1616	885a0b77ab	Chore: Support llama3.2.	2024-09-25 16:08:44 -04:00
hoshi-hiyouga	92ef62f502	add qwen2.5 models	2024-09-19 02:07:54 +08:00
BUAADreamer	31259e7e0c	support llava-next(video)	2024-09-10 12:31:53 +08:00
hiyouga	90d6df6222	release v0.9.0 (real)	2024-09-09 01:00:25 +08:00
hiyouga	653fe70acb	fix constants	2024-09-08 23:52:30 +08:00
hiyouga	54b5c4b819	release v0.9.0	2024-09-08 23:43:35 +08:00
hoshi-hiyouga	1274356263	Merge pull request #5372 from LDLINGLINGLING/main 增加了对minicpm3.0的适配'	2024-09-05 21:35:42 +08:00
liudan	3d3fbaaff9	根据代码规范修改了代码	2024-09-05 20:17:55 +08:00
hiyouga	359ef8bb0e	support Yi-Coder models	2024-09-05 03:12:24 +08:00
hiyouga	8cafc7b055	video datasets	2024-09-05 02:04:17 +08:00
liudan	d7ba97be48	增加了对minicpm3.0的适配'	2024-09-04 23:10:05 +08:00
hiyouga	3382317e32	refactor mm training	2024-08-30 02:14:31 +08:00
hiyouga	dc770efb14	add qwen2 math models	2024-08-09 20:20:35 +08:00
hiyouga	b7ca6c8dc1	fix #5048	2024-08-05 23:48:19 +08:00
codingma	dc09d454f2	support gemma-2-2b	2024-08-01 13:45:48 +08:00
hiyouga	1550fe7331	add mistral nemo model	2024-07-24 16:25:53 +08:00
hiyouga	26533c0604	add llama3.1	2024-07-24 16:20:11 +08:00
hiyouga	53b1002fb7	add codegeex4, internlm2.5	2024-07-06 16:16:47 +08:00
hiyouga	6fd6aa4530	fix packing for eager/sdpa attn	2024-07-04 01:52:43 +08:00
hoshi-hiyouga	87d9b2d005	Merge pull request #4224 from chuan298/main Implement efficient packing without cross-contamination attention	2024-07-04 01:18:54 +08:00
hiyouga	cce7083024	update packing	2024-07-04 01:10:55 +08:00
hiyouga	8a6a7b9c8a	update arg name	2024-07-03 23:23:24 +08:00
hiyouga	c47ab6c072	improve rlhf	2024-07-02 22:23:08 +08:00
hzhaoy	57b7c00430	add TeleChat-1B	2024-07-02 17:49:04 +08:00
hoshi-hiyouga	e8e6af2651	Merge branch 'main' into main	2024-07-01 21:01:09 +08:00
hiyouga	6f63050e1b	add Gemma2 models	2024-06-28 01:26:50 +08:00
hiyouga	e507e60638	update readme	2024-06-24 18:22:12 +08:00
ancv	770f75dc83	move configure_packing to llamafactory.model.patcher and fix constants	2024-06-21 00:45:06 +07:00
hiyouga	a233fbc258	add deepseek coder v2 #4346	2024-06-18 22:53:54 +08:00
ancv	238f5c3d99	update packing with sdpa and eager attention mode	2024-06-16 02:25:47 +07:00
hiyouga	572d8bbfdd	add minicpm #4227	2024-06-15 17:58:52 +08:00
hiyouga	d87108daa6	add license	2024-06-15 17:54:33 +08:00
hiyouga	06e5d136a4	add resume args in webui	2024-06-08 00:22:16 +08:00
hiyouga	8e95648850	add qwen2 models	2024-06-07 00:22:57 +08:00
hiyouga	cae4737907	lora modules: all by default	2024-06-06 03:53:28 +08:00
hiyouga	c23cc63d3d	add codestral 22B	2024-06-06 03:42:50 +08:00
hiyouga	f48f5e646e	support glm-4	2024-06-05 15:16:38 +08:00
hiyouga	8070871732	better llamaboard * easily resume from checkpoint * support full and freeze checkpoints * faster ui	2024-05-29 23:55:38 +08:00
hiyouga	89ca832740	update readme	2024-05-29 18:39:11 +08:00
hzhaoy	0dd632fe9e	add TeleChat-12B/TeleChat-12B-v2 models	2024-05-29 15:00:37 +08:00
hiyouga	c1fdf81df6	tiny fix	2024-05-27 20:54:26 +08:00

1 2

63 Commits