LLaMA-Factory

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-02-27 00:05:58 +08:00

Author	SHA1	Message	Date
hoshi-hiyouga	2bfcad2394	[model] fix kv cache (#7564 )	2025-04-01 23:07:46 +08:00
Qiaolin Yu	a44a53ebec	[inference] support sglang backend (#7278 ) * Mimic SGLang offline Engine * Add more tests and args * Pass all current tests * Clean Code * fix sample_params * clean code * Fix Stream Chat * change sglang from engine mode to server mode * fix * Fix Review Issues * Use SGLang Built-In Utilities * Fix test SGLang * Some Doc Issue * fix sglang engine * add readme --------- Co-authored-by: Jin Pan <jpan236@wisc.edu> Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>	2025-03-15 04:37:58 +08:00
hoshi-hiyouga	71a1c1321a	[config] update args (#7231 ) Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6	2025-03-10 23:04:43 +08:00
hoshi-hiyouga	82a2bac866	[misc] fix ds config (#7205 ) Former-commit-id: b478fa1d9de1858075769f86f57126fde92db813	2025-03-07 15:21:28 +08:00
hoshi-hiyouga	7b985f55db	[trainer] update config (#7174 ) Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f	2025-03-05 23:32:54 +08:00
hoshi-hiyouga	c1d5073bd3	[model] add models (#7054 ) * add qwen25vl awq models * add moonlight Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916	2025-02-24 22:05:13 +08:00
hoshi-hiyouga	f5cd17881e	[data] update vlm args (#6976 ) Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab	2025-02-18 02:12:51 +08:00
hoshi-hiyouga	c09b648934	[data] add min resolution option (#6975 ) Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327	2025-02-18 01:40:46 +08:00
hoshi-hiyouga	290057069e	[misc] update readme (#6917 ) Former-commit-id: 6bbed1d8c4189fb7bea40230e278c40bb5336fbd	2025-02-13 00:58:10 +08:00
Eric Tang	5a221d91f9	[example] fix path to ray example (#6906 ) Former-commit-id: e9bee3ef045d85051da04e6ad581a23a9e1a9551	2025-02-13 00:29:32 +08:00
hoshi-hiyouga	86063e27ea	[data] fix ollama template (#6902 ) * fix ollama template * add meta info * use half precision Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536	2025-02-11 22:43:09 +08:00
hoshi-hiyouga	88eafd865b	[misc] support export ollama modelfile (#6899 ) * support export ollama modelfile * update config * add system and num ctx Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20	2025-02-11 19:52:25 +08:00
hoshi-hiyouga	332f637592	disable valset by default (#6690 ) Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533	2025-01-17 21:09:30 +08:00
hoshi-hiyouga	7638f1070e	[optim] clean apollo (#6645 ) * clean apollo code * update readme Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a	2025-01-15 01:42:50 +08:00
zhuHQ	c2120432db	[optim] add support to APOLLO (#6617 ) Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824	2025-01-15 00:24:56 +08:00
hoshi-hiyouga	2a05941b14	[inference] fix stop token for object detection (#6624 ) * fix stop token * update minicpm data pipeline * fix npu qlora examples Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f	2025-01-13 21:34:20 +08:00
codingma	11c38b9173	add nf4 qlora support on Ascend NPU (#6601 ) * add nf4 qlora support on Ascend NPU * add transformers version check * add python>=3.10 requirement description for npu * tiny fix --------- Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> Former-commit-id: 7912d1acac5f10dab22145fe729a90c57aad8d85	2025-01-13 19:43:36 +08:00
hiyouga	dc65ecdf09	refactor mllm param logic Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246	2025-01-10 15:45:48 +00:00
hiyouga	944a2aec4d	refactor ray integration, support save ckpt Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2	2025-01-07 09:39:10 +00:00
Eric Tang	4f31ad997c	run style check Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b	2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi	8683582300	drafting ray integration Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com> Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960	2025-01-07 08:55:44 +00:00
Yaser Afshar	6f1c8dacea	Add missing key to init_kwargs Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca	2024-12-17 12:34:05 +00:00
Yaser Afshar	8881237475	Add trust_remote_code parameter and remove True - Introduced a new model parameter `trust_remote_code` - Set the default value of `trust_remote_code` to `False` to enhance security Former-commit-id: 4bf23f406cf5235c16f9f8139850c53354901814	2024-12-17 12:25:12 +00:00
hiyouga	8c65548b10	update assets Former-commit-id: 7b9bd552b2bf97b72976511094eb51dfde5d1017	2024-12-14 17:36:03 +00:00
hiyouga	fb22651faf	fix mrope Former-commit-id: 55bee1d333549ca19858b3f5c1b7b86926e5fb09	2024-12-12 15:08:17 +00:00
hiyouga	bac2c64f87	support qwen2vl train proj only Former-commit-id: 0e949ef03455726e907c6f1039e93ebe480c897a	2024-12-05 10:37:42 +00:00
hiyouga	39865d8a1f	update examples Former-commit-id: bcb010be7732ae137f156932100ee4d02a93725c	2024-12-05 08:48:25 +00:00
hiyouga	c1768cfb14	support batch infer in vllm Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5	2024-12-04 13:50:00 +00:00
hiyouga	1e6f96508a	add vllm config Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb	2024-11-10 21:28:18 +08:00
hiyouga	ba66ac084f	update tests Former-commit-id: 4e92b656e324725048d914946e70867be20032ff	2024-11-02 12:41:44 +08:00
hiyouga	9bdba2f6a8	add e2e tests Former-commit-id: 0156a37450604641c4f5f9756ad84324698fc88c	2024-09-05 21:52:28 +08:00
hiyouga	60cf12727b	add rlhf-v dataset Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437	2024-09-01 22:57:41 +08:00
hiyouga	2f6fc27c8b	remove visual_inputs, fix qlora Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a	2024-08-31 00:24:51 +08:00
hiyouga	66a1abac6a	add examples Former-commit-id: 169c68921b1b8ac279834b060d9e7d38a56fe1aa	2024-08-30 21:43:19 +08:00
hiyouga	c62a6ca59d	refactor mm training Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a	2024-08-30 02:14:31 +08:00
simonJJJ	0f3d54d8a0	initial-commit Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928	2024-08-28 16:51:35 +08:00
hiyouga	47efcdb1dd	update examples Former-commit-id: d5c57c8b7f64afe8061045ec9689abbac45c1175	2024-08-09 20:13:46 +08:00
hiyouga	59cbce1a46	add adam_mini to readme Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11	2024-08-09 20:02:03 +08:00
hiyouga	9d1e2c3c1f	update scripts Former-commit-id: dabf5a1dc661a6581474c6a5ec115322d168ed5f	2024-08-09 19:16:23 +08:00
hiyouga	5af32ce705	follow #5115 Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a	2024-08-09 18:03:00 +08:00
codingma	eada49e56b	fix eval_dataset in example Former-commit-id: e1ffc54f7e58419cc8da958a4d3c2697e18d5583	2024-08-07 18:24:19 +08:00
hiyouga	48f0819327	fix #4944 Former-commit-id: 9e8cf3b21a0b12d1413c3c7f3d60399784909242	2024-07-24 16:42:51 +08:00
hoshi-hiyouga	16d655b119	Update llama3_lora_eval.yaml Former-commit-id: 946836f9a3f3385c8d3bc6ab82df6edf13ee571c	2024-07-15 22:55:12 +08:00
codingma	0ea708c226	1. change the task name format 2. delete split param in data_args.py Former-commit-id: 309d30efe24785912ff751fc573677875fc5819e	2024-07-15 09:55:33 +08:00
hiyouga	e4d11a117b	fix up Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42	2024-07-15 01:04:56 +08:00
hoshi-hiyouga	bf6ad1fbed	Update llava1_5.yaml Former-commit-id: 68c9670be5a6f9d9ec589f13b43c45aa0ed90033	2024-07-13 20:30:06 +08:00
codingma	bc71380b59	1. fix output_dir in llama3_lora_pretrain.yaml 2. add llava1_5.yaml for inference Former-commit-id: 560928ecf04b7aa351812568d317fcde58bc64d6	2024-07-13 13:16:22 +08:00
hiyouga	74777b4ded	update pissa example Former-commit-id: d01bae6af5f3a619c50247efc8fd83d9f521c6ed	2024-07-06 15:47:32 +08:00
hiyouga	024760f866	update examples Former-commit-id: 66f248b90cfa2b29c73060459b2337b78154c47b	2024-06-28 01:17:07 +08:00
hiyouga	8e5b4bddf4	update examples Former-commit-id: cce238f7d07919b79237bc9ab39265766c20f020	2024-06-27 00:53:33 +08:00

1 2 3

130 Commits