LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-20 14:18:55 +08:00

Author	SHA1	Message	Date
hiyouga	3d72b1a856	fix jinja template Former-commit-id: `2b596fb55f`	2024-06-19 20:03:50 +08:00
hiyouga	7735456561	fix templates Former-commit-id: `4cff6a4ad5`	2024-06-19 17:44:05 +08:00
hiyouga	c9557241f6	fix bug Former-commit-id: `6d2bf216ac`	2024-06-19 03:49:23 +08:00
hiyouga	e73a235a38	use prefix to replace force system Former-commit-id: `4f22eae8f4`	2024-06-19 03:39:52 +08:00
hiyouga	bccc852f76	fix tool formatter, allow parallel function #4362 Former-commit-id: `cd75b1fe9d`	2024-06-19 03:23:51 +08:00
hoshi-hiyouga	6db02615d4	Merge pull request #4173 from mMrBun/main Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format Former-commit-id: `c0ca42566c`	2024-06-19 03:18:55 +08:00
hiyouga	c0c6b8075a	tiny fix Former-commit-id: `38b6b0f52e`	2024-06-16 01:06:41 +08:00
ancv	9d9f8c6531	remove some unused params Former-commit-id: `04315c3d92`	2024-06-15 23:00:55 +07:00
hiyouga	2946153cea	add license Former-commit-id: `d87108daa6`	2024-06-15 17:54:33 +08:00
hiyouga	8fccaf20c5	fix #4221 Former-commit-id: `6baafd4eb3`	2024-06-13 02:48:21 +08:00
ancv	045eb155a2	implement efficient packing without cross-contamination attention Former-commit-id: `b2c367bc61`	2024-06-12 11:56:01 +07:00
hoshi-hiyouga	bf3de9bfe8	Update pretrain.py Former-commit-id: `0c29233237`	2024-06-11 17:02:14 +08:00
d	da39715085	经过大量的增量预训练，进行对比试验，发现这个bug：llama3在预训练时使用的tokenizer.eos_toke是'<\|end_of_text\|>' ，这里在每条数据后面也得用这个，而不是'<\|eot_id\|>'，否则很容易导致严重的性能下降 Former-commit-id: `6979f3f848`	2024-06-11 16:23:40 +08:00
mMrBun	b6d63b3324	Optimize the handling of QWEN2 in scenarios involving multiple tool calls. Former-commit-id: `950e360ca0`	2024-06-10 02:00:14 +08:00
mMrBun	3f11ab800f	Removed unnecessary comments. Former-commit-id: `6ed0b0c800`	2024-06-09 18:25:22 +08:00
mMrBun	daf472994d	Merge branch 'hiyouga:main' into main Former-commit-id: `0f2609ce19`	2024-06-09 18:17:24 +08:00
mMrBun	18a86ea104	Implemented the tool_formatter and tool_extractor for glm4 tool_format Former-commit-id: `cb1cbcb293`	2024-06-09 18:16:15 +08:00
hiyouga	ce40d12692	release v0.8.0 Former-commit-id: `5aa4ce4756`	2024-06-08 05:20:54 +08:00
hiyouga	c6f5f69644	update data processors Former-commit-id: `ccc8b64cc2`	2024-06-07 04:15:40 +08:00
hoshi-hiyouga	4953ded639	Merge pull request #4009 from AlongWY/main supervised packing with greedy knapsack algorithm Former-commit-id: `181dbb0d05`	2024-06-07 03:48:46 +08:00
hoshi-hiyouga	e3ef239bc0	Update supervised.py Former-commit-id: `c09ad8bab3`	2024-06-07 03:42:08 +08:00
hoshi-hiyouga	fd7bd911a6	Update supervised.py Former-commit-id: `788e8232fc`	2024-06-07 03:38:23 +08:00
hoshi-hiyouga	21df5f0bd0	Update supervised.py Former-commit-id: `8cecade708`	2024-06-07 03:38:04 +08:00
hiyouga	8da149ba40	rename files Former-commit-id: `74f96efef9`	2024-06-07 00:09:06 +08:00
hiyouga	e0aadd4b34	fix ppo dataset bug #4012 Former-commit-id: `149610c636`	2024-06-06 19:03:20 +08:00
hiyouga	94c37490d1	support glm-4 Former-commit-id: `f48f5e646e`	2024-06-05 15:16:38 +08:00
hiyouga	0eff6a66d5	tiny fix Former-commit-id: `5a13b3baa6`	2024-06-04 00:31:10 +08:00
hiyouga	8ecf606230	fix #3992 Former-commit-id: `a18acf2abe`	2024-06-04 00:17:36 +08:00
hiyouga	64d24842fe	fix data loader hint Former-commit-id: `49b1e88e3d`	2024-06-03 18:28:27 +08:00
ylfeng	62d55b71a3	remove empty line Former-commit-id: `b47e317447`	2024-05-31 21:43:08 +08:00
ylfeng	0feb2ad35c	fix eos Former-commit-id: `84aee57901`	2024-05-31 21:40:41 +08:00
ylfeng	8350e508d3	supervised packing with greedy knapsack algorithm Former-commit-id: `f9db439cb7`	2024-05-31 15:33:54 +08:00
hoshi-hiyouga	9b6bdf9449	Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num Add dataset sample num Former-commit-id: `483eb47e5d`	2024-05-30 00:25:45 +08:00
hoshi-hiyouga	7b83c550ab	Update loader.py Former-commit-id: `ca5dd7c6c1`	2024-05-30 00:20:20 +08:00
hoshi-hiyouga	9fc713da89	Update loader.py Former-commit-id: `f9a88b89ca`	2024-05-30 00:17:21 +08:00
hoshi-hiyouga	c0f11a280e	Update loader.py Former-commit-id: `b55fb611c5`	2024-05-30 00:12:12 +08:00
hoshi-hiyouga	69a51cacb1	Update parser.py Former-commit-id: `51dd454337`	2024-05-30 00:05:20 +08:00
hiyouga	19a3262387	fix cohere system Former-commit-id: `d0aa36b8ad`	2024-05-29 20:58:23 +08:00
hiyouga	c05cb3769f	fix #3965 Former-commit-id: `0930f58699`	2024-05-29 20:55:51 +08:00
hiyouga	a71a6a05c3	update readme Former-commit-id: `89ca832740`	2024-05-29 18:39:11 +08:00
hzhaoy	ce1be3da4b	add TeleChat-12B/TeleChat-12B-v2 models Former-commit-id: `0dd632fe9e`	2024-05-29 15:00:37 +08:00
Yimi81	7324984127	fix yi template Former-commit-id: `dc07413e7d`	2024-05-27 13:11:25 +00:00
hiyouga	0706dbf7e6	tiny fix Former-commit-id: `c1fdf81df6`	2024-05-27 20:54:26 +08:00
hoshi-hiyouga	eceec1d7fd	Update template.py Former-commit-id: `f1002b9f93`	2024-05-27 20:51:56 +08:00
hoshi-hiyouga	b7b8223230	Update template.py Former-commit-id: `122213a7a7`	2024-05-27 20:51:26 +08:00
Jianbai Ye	d2c1df7f3d	add openchat-3.6-8B support Former-commit-id: `cff815391f`	2024-05-27 20:42:08 +08:00
hiyouga	df33548b39	update readme Former-commit-id: `5581cb2e4e`	2024-05-27 18:14:02 +08:00
seanzhang-zhichen	9c8d79fbe3	Merge branch 'main' into add_dataset_sample_num Former-commit-id: `27cb51f7f8`	2024-05-24 15:57:47 +08:00
hiyouga	3e729798df	refactor data preprocessing, fix mllm rlhf Former-commit-id: `3a023bca2a`	2024-05-24 04:08:25 +08:00
hiyouga	d3490aceb7	fix paligemma sft requires transformers>=4.41.1 Former-commit-id: `de0e67aff1`	2024-05-24 00:23:40 +08:00

1 2 3 4

161 Commits