LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-18 13:18:57 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	788e8232fc	Update supervised.py	2024-06-07 03:38:23 +08:00
hoshi-hiyouga	8cecade708	Update supervised.py	2024-06-07 03:38:04 +08:00
ylfeng	b47e317447	remove empty line	2024-05-31 21:43:08 +08:00
ylfeng	84aee57901	fix eos	2024-05-31 21:40:41 +08:00
ylfeng	f9db439cb7	supervised packing with greedy knapsack algorithm	2024-05-31 15:33:54 +08:00
hoshi-hiyouga	483eb47e5d	Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num Add dataset sample num	2024-05-30 00:25:45 +08:00
hoshi-hiyouga	ca5dd7c6c1	Update loader.py	2024-05-30 00:20:20 +08:00
hoshi-hiyouga	f9a88b89ca	Update loader.py	2024-05-30 00:17:21 +08:00
hoshi-hiyouga	b55fb611c5	Update loader.py	2024-05-30 00:12:12 +08:00
hoshi-hiyouga	51dd454337	Update parser.py	2024-05-30 00:05:20 +08:00
hiyouga	8070871732	better llamaboard * easily resume from checkpoint * support full and freeze checkpoints * faster ui	2024-05-29 23:55:38 +08:00
hiyouga	d0aa36b8ad	fix cohere system	2024-05-29 20:58:23 +08:00
hiyouga	0930f58699	fix #3965	2024-05-29 20:55:51 +08:00
hiyouga	89ca832740	update readme	2024-05-29 18:39:11 +08:00
hzhaoy	0dd632fe9e	add TeleChat-12B/TeleChat-12B-v2 models	2024-05-29 15:00:37 +08:00
hiyouga	97346c1d3d	fix hf chat engine	2024-05-29 01:20:07 +08:00
hiyouga	e4b420c146	add ds config to webui	2024-05-29 01:13:17 +08:00
hiyouga	65cd8bdbdb	10x generate in ppo w/ zero3 https://github.com/huggingface/trl/pull/1483	2024-05-29 00:23:23 +08:00
hiyouga	7c8e01bb74	update dpo, kto trainer	2024-05-29 00:14:29 +08:00
hiyouga	900e1ea622	clean kto trainer	2024-05-28 21:43:26 +08:00
hiyouga	1e80a3a638	bump vllm version to 0.4.1	2024-05-28 21:27:27 +08:00
hiyouga	087b9faa39	update readme	2024-05-28 19:35:52 +08:00
hiyouga	7c016b22aa	support DDP in webui	2024-05-28 19:24:22 +08:00
Yimi81	dc07413e7d	fix yi template	2024-05-27 13:11:25 +00:00
hiyouga	c1fdf81df6	tiny fix	2024-05-27 20:54:26 +08:00
hoshi-hiyouga	87ea0a8bcd	Merge pull request #3921 from gusye1234/main Add openchat-3.6-8B support	2024-05-27 20:52:37 +08:00
hoshi-hiyouga	f1002b9f93	Update template.py	2024-05-27 20:51:56 +08:00
hoshi-hiyouga	122213a7a7	Update template.py	2024-05-27 20:51:26 +08:00
Jianbai Ye	cff815391f	add openchat-3.6-8B support	2024-05-27 20:42:08 +08:00
hiyouga	08564838bd	fix full/freeze tuning for mllm	2024-05-27 20:37:57 +08:00
hoshi-hiyouga	838f2fb3e4	Merge pull request #3835 from BUAADreamer/main fix some features in llava-style training	2024-05-27 20:23:45 +08:00
hiyouga	e626e26446	support Aya23	2024-05-27 20:23:24 +08:00
BUAADreamer	ea2afd429e	Merge branch 'hiyouga:main' into main	2024-05-27 19:00:48 +08:00
BUAADreamer	57eb13b75d	add regex of only tune lm and mm_proj	2024-05-27 18:59:00 +08:00
hiyouga	efa4b196ca	add phi-3 7b/14b, mistral v0.3 models	2024-05-27 18:20:16 +08:00
hiyouga	5581cb2e4e	update readme	2024-05-27 18:14:02 +08:00
BUAADreamer	4bc7c10c00	Merge branch 'hiyouga:main' into main	2024-05-27 11:54:01 +08:00
hiyouga	cb63b32986	support SimPO #3900	2024-05-26 23:46:33 +08:00
BUAADreamer	60170a1da4	Merge branch 'hiyouga:main' into main	2024-05-25 14:18:49 +08:00
hiyouga	063f91cc80	fix #3853	2024-05-24 23:29:45 +08:00
seanzhang-zhichen	27cb51f7f8	Merge branch 'main' into add_dataset_sample_num	2024-05-24 15:57:47 +08:00
BUAADreamer	047a06a1e5	Merge branch 'hiyouga:main' into main	2024-05-24 09:50:00 +08:00
hiyouga	3a023bca2a	refactor data preprocessing, fix mllm rlhf	2024-05-24 04:08:25 +08:00
hiyouga	de0e67aff1	fix paligemma sft requires transformers>=4.41.1	2024-05-24 00:23:40 +08:00
hiyouga	67ebc7b388	fix oom issues in export	2024-05-23 23:32:45 +08:00
BUAADreamer	8d53ec2b5f	Merge branch 'hiyouga:main' into main	2024-05-21 22:18:20 +08:00
hiyouga	7134fb02bb	fix paligemma sft	2024-05-21 20:03:09 +08:00
hiyouga	335501e228	fix #3847	2024-05-21 17:53:06 +08:00
BUAADreamer	29a6d5bdb8	support pretraining of llava	2024-05-21 08:57:14 +08:00
hiyouga	2a67457e39	support paligemma	2024-05-21 00:01:22 +08:00

1 2 3 4 5 ...

1106 Commits