LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-21 06:38:54 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	326f180397	Merge pull request #3987 from injet-zhou/main Fix cann't interrupt training when using multi GPUs in webui Former-commit-id: `ae18e1e251`	2024-06-04 00:04:07 +08:00
hiyouga	e2920aa925	fix #4043 Former-commit-id: `79784ebeb6`	2024-06-03 23:30:37 +08:00
hiyouga	6f7b6ae0c3	remove gc warnings in DPO&KTO Former-commit-id: `f9a206509e`	2024-06-03 22:53:54 +08:00
hoshi-hiyouga	5d96cf146e	Update trainer.py Former-commit-id: `24499f40dc`	2024-06-03 22:08:38 +08:00
enji.zhou	e58aca0602	fix KTO Trainer Sampler Former-commit-id: `34a2c5087a`	2024-06-03 21:32:38 +08:00
hoshi-hiyouga	f6f1c4eacb	Merge pull request #4006 from Uminosachi/scheduler-kwargs Set scheduler_specific_kwargs to get_scheduler Former-commit-id: `0f01500b68`	2024-06-03 19:27:53 +08:00
hiyouga	e4ce59243b	fix #4005 #4013 Former-commit-id: `eed33862bc`	2024-06-03 19:12:29 +08:00
hoshi-hiyouga	eaab09fccb	Merge pull request #4007 from xu-song/patch-3 Update model_args.py Former-commit-id: `1539c72b94`	2024-06-03 18:54:37 +08:00
hiyouga	d0ceb1b091	fix #4022 Former-commit-id: `24e1c0e2ee`	2024-06-03 18:38:36 +08:00
hiyouga	af7748139a	bump versions transformers 4.37.2->4.41.2 datasets 2.14.3->2.16.0 accelerate 0.27.2->0.30.1 peft 0.10.0->0.11.1 trl 0.8.1->0.8.6 Former-commit-id: `876bc92865`	2024-06-03 18:29:38 +08:00
hiyouga	64d24842fe	fix data loader hint Former-commit-id: `49b1e88e3d`	2024-06-03 18:28:27 +08:00
ylfeng	62d55b71a3	remove empty line Former-commit-id: `b47e317447`	2024-05-31 21:43:08 +08:00
ylfeng	0feb2ad35c	fix eos Former-commit-id: `84aee57901`	2024-05-31 21:40:41 +08:00
ylfeng	8350e508d3	supervised packing with greedy knapsack algorithm Former-commit-id: `f9db439cb7`	2024-05-31 15:33:54 +08:00
Xu Song	abe33220bf	Update model_args.py Former-commit-id: `dade2f083d`	2024-05-31 14:35:48 +08:00
Uminosachi	0de4e1e9e2	Set scheduler_specific_kwargs to get_scheduler Former-commit-id: `14e97dc119`	2024-05-31 13:45:39 +09:00
faddddeout	64976e426c	fix cann't interrupt training when using multi GPUs in webui Former-commit-id: `b13d03946e`	2024-05-30 08:39:21 +00:00
hoshi-hiyouga	9b6bdf9449	Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num Add dataset sample num Former-commit-id: `483eb47e5d`	2024-05-30 00:25:45 +08:00
hoshi-hiyouga	7b83c550ab	Update loader.py Former-commit-id: `ca5dd7c6c1`	2024-05-30 00:20:20 +08:00
hoshi-hiyouga	9fc713da89	Update loader.py Former-commit-id: `f9a88b89ca`	2024-05-30 00:17:21 +08:00
hoshi-hiyouga	c0f11a280e	Update loader.py Former-commit-id: `b55fb611c5`	2024-05-30 00:12:12 +08:00
hoshi-hiyouga	69a51cacb1	Update parser.py Former-commit-id: `51dd454337`	2024-05-30 00:05:20 +08:00
hiyouga	820404946e	better llamaboard * easily resume from checkpoint * support full and freeze checkpoints * faster ui Former-commit-id: `8070871732`	2024-05-29 23:55:38 +08:00
hiyouga	19a3262387	fix cohere system Former-commit-id: `d0aa36b8ad`	2024-05-29 20:58:23 +08:00
hiyouga	c05cb3769f	fix #3965 Former-commit-id: `0930f58699`	2024-05-29 20:55:51 +08:00
hiyouga	a71a6a05c3	update readme Former-commit-id: `89ca832740`	2024-05-29 18:39:11 +08:00
hzhaoy	ce1be3da4b	add TeleChat-12B/TeleChat-12B-v2 models Former-commit-id: `0dd632fe9e`	2024-05-29 15:00:37 +08:00
hiyouga	05277ee864	fix hf chat engine Former-commit-id: `97346c1d3d`	2024-05-29 01:20:07 +08:00
hiyouga	13e7b64641	add ds config to webui Former-commit-id: `e4b420c146`	2024-05-29 01:13:17 +08:00
hiyouga	468d0e7ed1	10x generate in ppo w/ zero3 https://github.com/huggingface/trl/pull/1483 Former-commit-id: `65cd8bdbdb`	2024-05-29 00:23:23 +08:00
hiyouga	bfac965f9c	update dpo, kto trainer Former-commit-id: `7c8e01bb74`	2024-05-29 00:14:29 +08:00
hiyouga	14f6cc2b7c	clean kto trainer Former-commit-id: `900e1ea622`	2024-05-28 21:43:26 +08:00
hiyouga	87e71df597	bump vllm version to 0.4.1 Former-commit-id: `1e80a3a638`	2024-05-28 21:27:27 +08:00
hiyouga	3152c7dd1c	update readme Former-commit-id: `087b9faa39`	2024-05-28 19:35:52 +08:00
hiyouga	3ea8f5e6b9	support DDP in webui Former-commit-id: `7c016b22aa`	2024-05-28 19:24:22 +08:00
Yimi81	7324984127	fix yi template Former-commit-id: `dc07413e7d`	2024-05-27 13:11:25 +00:00
hiyouga	0706dbf7e6	tiny fix Former-commit-id: `c1fdf81df6`	2024-05-27 20:54:26 +08:00
hoshi-hiyouga	ad3ca3f556	Merge pull request #3921 from gusye1234/main Add openchat-3.6-8B support Former-commit-id: `87ea0a8bcd`	2024-05-27 20:52:37 +08:00
hoshi-hiyouga	eceec1d7fd	Update template.py Former-commit-id: `f1002b9f93`	2024-05-27 20:51:56 +08:00
hoshi-hiyouga	b7b8223230	Update template.py Former-commit-id: `122213a7a7`	2024-05-27 20:51:26 +08:00
Jianbai Ye	d2c1df7f3d	add openchat-3.6-8B support Former-commit-id: `cff815391f`	2024-05-27 20:42:08 +08:00
hiyouga	b88ecd71fd	fix full/freeze tuning for mllm Former-commit-id: `08564838bd`	2024-05-27 20:37:57 +08:00
hoshi-hiyouga	605e70d0e1	Merge pull request #3835 from BUAADreamer/main fix some features in llava-style training Former-commit-id: `838f2fb3e4`	2024-05-27 20:23:45 +08:00
hiyouga	fc5a6b5c4e	support Aya23 Former-commit-id: `e626e26446`	2024-05-27 20:23:24 +08:00
BUAADreamer	5632ba3fa8	Merge branch 'hiyouga:main' into main Former-commit-id: `ea2afd429e`	2024-05-27 19:00:48 +08:00
BUAADreamer	606240aec0	add regex of only tune lm and mm_proj Former-commit-id: `57eb13b75d`	2024-05-27 18:59:00 +08:00
hiyouga	51a1097c64	add phi-3 7b/14b, mistral v0.3 models Former-commit-id: `efa4b196ca`	2024-05-27 18:20:16 +08:00
hiyouga	df33548b39	update readme Former-commit-id: `5581cb2e4e`	2024-05-27 18:14:02 +08:00
BUAADreamer	a6c2a2071d	Merge branch 'hiyouga:main' into main Former-commit-id: `4bc7c10c00`	2024-05-27 11:54:01 +08:00
hiyouga	4807c11db8	support SimPO #3900 Former-commit-id: `cb63b32986`	2024-05-26 23:46:33 +08:00

... 2 3 4 5 6 ...

1268 Commits