LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2025-12-15 19:30:36 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	7d812ed841	Update loader.py	2024-04-26 03:22:40 +08:00
BUAADreamer	2d4ded535f	modify some style	2024-04-25 21:58:18 +08:00
BUAADreamer	235b411370	modify style	2024-04-25 21:29:50 +08:00
BUAADreamer	1dcabafe72	modify style	2024-04-25 21:15:16 +08:00
BUAADreamer	94ad744941	add some	2024-04-25 21:08:32 +08:00
BUAADreamer	c6dd89918f	merge data part to the text stream	2024-04-25 19:19:59 +08:00
BUAADreamer	838eb87a96	merge model part to the text stream	2024-04-25 08:20:41 +08:00
BUAADreamer	8239907f57	remove error	2024-04-25 01:01:59 +08:00
BUAADreamer	7ffee90799	remove conflicts	2024-04-25 00:34:22 +08:00
BUAADreamer	cfb485eddf	add llava and instructblip	2024-04-25 00:22:43 +08:00
hiyouga	297fb8ead3	support new special token #3420	2024-04-24 23:39:31 +08:00
hiyouga	b1deb0a0b9	support unsloth generate	2024-04-24 04:46:53 +08:00
hiyouga	aa2b79eb23	refactor patcher	2024-04-24 03:02:23 +08:00
hiyouga	707f0b1d5d	fix #3347 #3387	2024-04-24 01:30:16 +08:00
BUAADreamer	4dcb11eab7	add multimodal LLM BLIP-2 and InstructBLIP	2024-04-23 18:45:43 +08:00
hiyouga	f58425ab45	fix mod stuff	2024-04-21 18:11:10 +08:00
hoshi-hiyouga	d0273787be	Merge pull request #3338 from astramind-ai/main Adding Mixture of Depth	2024-04-21 18:05:52 +08:00
hoshi-hiyouga	1fa287fd63	fix #3348	2024-04-20 10:34:09 +08:00
Marco	620add7b9f	Added Mixture of Depths	2024-04-18 20:31:24 +02:00
hiyouga	7dc72fb58c	support unsloth 2024.4	2024-04-16 00:25:03 +08:00
hiyouga	6543f3d449	add codegemma	2024-04-16 00:11:15 +08:00
hiyouga	e0dbac2845	support cohere commandR #3184	2024-04-15 23:26:42 +08:00
hiyouga	cce52351b5	update examples	2024-04-15 22:14:34 +08:00
hiyouga	7f6c2486b8	fix quant infer and qwen2moe	2024-04-09 17:12:59 +08:00
hiyouga	148bda353f	fix resize vocab at inference #3022	2024-04-03 18:14:24 +08:00
hiyouga	b267aeb53f	add moe aux loss control #3085	2024-04-02 14:26:31 +08:00
hiyouga	7afbc85dae	fix #2928	2024-03-24 00:34:54 +08:00
hiyouga	72367307df	improve lora+ impl.	2024-03-13 23:32:51 +08:00
hiyouga	18ffce36b5	fix #2732	2024-03-09 22:37:16 +08:00
hiyouga	bdb496644c	allow non-packing pretraining	2024-03-09 22:21:46 +08:00
hiyouga	e8dd38b7fd	fix #2756 , patch #2746	2024-03-09 02:01:26 +08:00
hoshi-hiyouga	516d0ddc66	Merge pull request #2746 from stephen-nju/main fix deepspeed ppo RuntimeError	2024-03-09 01:37:00 +08:00
hiyouga	10be2f0ecc	fix aqlm version	2024-03-09 00:09:09 +08:00
stephen	cdb7f82869	fix ppo runtime error	2024-03-08 11:48:26 +08:00
hiyouga	f74f804a71	fix #2735	2024-03-07 16:15:53 +08:00
hiyouga	3016e65657	fix version checking	2024-03-06 14:51:51 +08:00
hiyouga	259af60d28	improve aqlm optim	2024-03-05 20:49:50 +08:00
hiyouga	d3d3dac707	optimize aqlm training	2024-03-05 18:35:41 +08:00
hiyouga	4e5fae2fac	fix #2649	2024-03-01 13:02:41 +08:00
hiyouga	c0be617195	fix #2642	2024-02-29 18:32:54 +08:00
hiyouga	fa5ab21ebc	release v0.5.3	2024-02-29 00:34:19 +08:00
hiyouga	cfefacaa37	support DoRA, AWQ, AQLM #2512	2024-02-28 19:53:28 +08:00
hiyouga	7924ffc55d	support llama pro #2338 , add rslora	2024-02-15 02:27:36 +08:00
younesbelkada	0ca0f08162	add v1 hf tags	2024-02-13 05:58:49 +00:00
hiyouga	91d09a01ac	add option to disable version check	2024-02-10 22:31:23 +08:00
hiyouga	38e63bfd28	bump up transformers version	2024-02-04 00:01:16 +08:00
hiyouga	638234ceee	format style	2024-01-20 20:15:56 +08:00
hiyouga	d9f1cae351	support function calling	2024-01-18 09:54:23 +08:00
hiyouga	55021097d5	fix rm server	2024-01-03 15:30:46 +08:00
hiyouga	47da742fc9	fix version	2023-12-29 04:53:36 +08:00

1 2

94 Commits