LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-03-11 14:36:00 +08:00

Author	SHA1	Message	Date
hiyouga	653fe70acb	fix constants	2024-09-08 23:52:30 +08:00
hiyouga	54b5c4b819	release v0.9.0	2024-09-08 23:43:35 +08:00
hiyouga	b6681d7198	support vllm 0.6.0	2024-09-08 02:26:20 +08:00
hiyouga	54c6905937	add docstrings, refactor logger	2024-09-08 00:56:56 +08:00
hoshi-hiyouga	1274356263	Merge pull request #5372 from LDLINGLINGLING/main 增加了对minicpm3.0的适配'	2024-09-05 21:35:42 +08:00
liudan	3d3fbaaff9	根据代码规范修改了代码	2024-09-05 20:17:55 +08:00
hiyouga	359ef8bb0e	support Yi-Coder models	2024-09-05 03:12:24 +08:00
hiyouga	8cafc7b055	video datasets	2024-09-05 02:04:17 +08:00
liudan	d7ba97be48	增加了对minicpm3.0的适配'	2024-09-04 23:10:05 +08:00
hiyouga	9967ccb3ae	fix mixed mm inputs and rlhf-v	2024-09-01 20:52:47 +08:00
hiyouga	3382317e32	refactor mm training	2024-08-30 02:14:31 +08:00
hiyouga	ad72f3e065	fix #5295	2024-08-29 20:30:18 +08:00
hiyouga	f6ae4e75dd	tiny fix	2024-08-27 12:49:32 +08:00
hiyouga	c8b4c7fee5	tiny fix	2024-08-20 00:10:52 +08:00
hoshi-hiyouga	d39f4a62d3	Merge pull request #5188 from Zxilly/main fix: report correct device count for intel xpu	2024-08-19 23:51:39 +08:00
Ricardo	384ab8db84	_is_bf16_available judgment supports npu	2024-08-16 02:58:22 +00:00
Zxilly	dc36fcc3de	fix: report correct device count for intel xpu	2024-08-15 08:30:43 +00:00
hiyouga	dc770efb14	add qwen2 math models	2024-08-09 20:20:35 +08:00
hiyouga	b7ca6c8dc1	fix #5048	2024-08-05 23:48:19 +08:00
codingma	dc09d454f2	support gemma-2-2b	2024-08-01 13:45:48 +08:00
hiyouga	1550fe7331	add mistral nemo model	2024-07-24 16:25:53 +08:00
hiyouga	26533c0604	add llama3.1	2024-07-24 16:20:11 +08:00
hiyouga	88c7fc1599	set dev version	2024-07-19 02:01:46 +08:00
hiyouga	bbd5a64423	release v0.8.3	2024-07-19 01:21:18 +08:00
hiyouga	d774b94f12	support batch_eval_metrics, fix #4826	2024-07-17 00:33:00 +08:00
hoshi-hiyouga	f84b007ebb	Update packages.py	2024-07-07 15:48:29 +08:00
Lian Junhong	322663bf90	chore: Update vllm_engine.py to support vllm version >= 0.5.1	2024-07-07 15:08:12 +08:00
hiyouga	53b1002fb7	add codegeex4, internlm2.5	2024-07-06 16:16:47 +08:00
hiyouga	6fd6aa4530	fix packing for eager/sdpa attn	2024-07-04 01:52:43 +08:00
hoshi-hiyouga	87d9b2d005	Merge pull request #4224 from chuan298/main Implement efficient packing without cross-contamination attention	2024-07-04 01:18:54 +08:00
hiyouga	cce7083024	update packing	2024-07-04 01:10:55 +08:00
hiyouga	8a6a7b9c8a	update arg name	2024-07-03 23:23:24 +08:00
hiyouga	c47ab6c072	improve rlhf	2024-07-02 22:23:08 +08:00
hzhaoy	57b7c00430	add TeleChat-1B	2024-07-02 17:49:04 +08:00
hoshi-hiyouga	e8e6af2651	Merge branch 'main' into main	2024-07-01 21:01:09 +08:00
hiyouga	d74244d568	fix #4398 #4592	2024-06-30 21:28:51 +08:00
hiyouga	6f63050e1b	add Gemma2 models	2024-06-28 01:26:50 +08:00
hiyouga	8baf3b22b0	refactor pissa, improve llamaboard	2024-06-28 01:04:24 +08:00
hiyouga	ad144c2265	support HQQ/EETQ #4113	2024-06-27 00:29:42 +08:00
hiyouga	e507e60638	update readme	2024-06-24 18:22:12 +08:00
ancv	770f75dc83	move configure_packing to llamafactory.model.patcher and fix constants	2024-06-21 00:45:06 +07:00
hiyouga	42e69a3c63	set dev version	2024-06-19 21:08:16 +08:00
hiyouga	71327ba85a	release v0.8.2	2024-06-19 20:42:09 +08:00
hiyouga	a233fbc258	add deepseek coder v2 #4346	2024-06-18 22:53:54 +08:00
ancv	238f5c3d99	update packing with sdpa and eager attention mode	2024-06-16 02:25:47 +07:00
hiyouga	38b6b0f52e	tiny fix	2024-06-16 01:06:41 +08:00
hiyouga	1b834f50be	add tests	2024-06-15 19:51:20 +08:00
hiyouga	572d8bbfdd	add minicpm #4227	2024-06-15 17:58:52 +08:00
hiyouga	d87108daa6	add license	2024-06-15 17:54:33 +08:00
hiyouga	2ed8270112	clean code	2024-06-13 01:58:16 +08:00

1 2 3

139 Commits