Zeju Qiu 
							
						 
					 
					
						
						
						
						
							
						
						
							003a2acb1a 
							
						 
					 
					
						
						
							
							[feature] adding orthogononal finetuning (OFT) to llama factory ( #8623 )  
						
						 
						
						... 
						
						
						
						Co-authored-by: Zeju <zqiu@g003.internal.cluster.is.localnet>
Co-authored-by: Zeju <zqiu@login2.is.localnet>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn> 
						
						
					 
					
						2025-08-18 18:22:47 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								XLXW 
							
						 
					 
					
						
						
						
						
							
						
						
							1ada15981a 
							
						 
					 
					
						
						
							
							[feature] add support for dft loss ( #8917 )  
						
						 
						
						
						
						
					 
					
						2025-08-15 23:29:57 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Yaowei Zheng 
							
						 
					 
					
						
						
						
						
							
						
						
							4dfad24902 
							
						 
					 
					
						
						
							
							[model] add gpt oss ( #8826 )  
						
						 
						
						
						
						
					 
					
						2025-08-06 05:56:46 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Butui Hu 
							
						 
					 
					
						
						
						
						
							
						
						
							1a33d65a56 
							
						 
					 
					
						
						
							
							[launcher] Add elastic and fault-tolerant training support ( #8286 )  
						
						 
						
						... 
						
						
						
						Signed-off-by: Butui Hu <hot123tea123@gmail.com> 
						
						
					 
					
						2025-06-05 16:40:03 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							aa9ed4db59 
							
						 
					 
					
						
						
							
							[example] update examples ( #7964 )  
						
						 
						
						
						
						
					 
					
						2025-05-06 17:24:25 +02:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							73198a6645 
							
						 
					 
					
						
						
							
							[misc] fix uv ( #7913 )  
						
						 
						
						
						
						
					 
					
						2025-04-30 07:45:03 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							b07628dea5 
							
						 
					 
					
						
						
							
							[example] add bash usage ( #7794 )  
						
						 
						
						
						
						
					 
					
						2025-04-22 00:25:51 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Juanxi Tian 
							
						 
					 
					
						
						
						
						
							
						
						
							12ada72ed4 
							
						 
					 
					
						
						
							
							[trainer] Add Muon Optimizer ( #7749 )  
						
						 
						
						... 
						
						
						
						Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> 
						
						
					 
					
						2025-04-21 23:38:37 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							416853dd25 
							
						 
					 
					
						
						
							
							[parser] support omegaconf ( #7793 )  
						
						 
						
						
						
						
					 
					
						2025-04-21 23:30:30 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							d222f63cb7 
							
						 
					 
					
						
						
							
							[infer] set env for vllm ascend ( #7745 )  
						
						 
						
						
						
						
					 
					
						2025-04-17 01:08:55 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								leo-pony 
							
						 
					 
					
						
						
						
						
							
						
						
							b9263ff5ac 
							
						 
					 
					
						
						
							
							[infer] support vllm-ascend ( #7739 )  
						
						 
						
						
						
						
					 
					
						2025-04-16 20:06:47 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eric Tang 
							
						 
					 
					
						
						
						
						
							
						
						
							bb8d79bae2 
							
						 
					 
					
						
						
							
							[ray] allow for specifying ray.init kwargs (i.e. runtime_env) ( #7647 )  
						
						 
						
						... 
						
						
						
						* ray init kwargs
* Update trainer_utils.py
* fix ray args
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn> 
						
						
					 
					
						2025-04-10 11:31:05 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							c3c0efbaa0 
							
						 
					 
					
						
						
							
							[misc] fix packing and eval plot ( #7623 )  
						
						 
						
						
						
						
					 
					
						2025-04-07 18:20:57 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							5115dc8c7f 
							
						 
					 
					
						
						
							
							[assets] update readme ( #7612 )  
						
						 
						
						
						
						
					 
					
						2025-04-06 13:58:49 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							2bfcad2394 
							
						 
					 
					
						
						
							
							[model] fix kv cache ( #7564 )  
						
						 
						
						
						
						
					 
					
						2025-04-01 23:07:46 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Qiaolin Yu 
							
						 
					 
					
						
						
						
						
							
						
						
							a44a53ebec 
							
						 
					 
					
						
						
							
							[inference] support sglang backend ( #7278 )  
						
						 
						
						... 
						
						
						
						* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn> 
						
						
					 
					
						2025-03-15 04:37:58 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							71a1c1321a 
							
						 
					 
					
						
						
							
							[config] update args ( #7231 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: f71a901840811bf560df671ec63a146ff99140c6 
						
						
					 
					
						2025-03-10 23:04:43 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							82a2bac866 
							
						 
					 
					
						
						
							
							[misc] fix ds config ( #7205 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: b478fa1d9de1858075769f86f57126fde92db813 
						
						
					 
					
						2025-03-07 15:21:28 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							7b985f55db 
							
						 
					 
					
						
						
							
							[trainer] update config ( #7174 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f 
						
						
					 
					
						2025-03-05 23:32:54 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							c1d5073bd3 
							
						 
					 
					
						
						
							
							[model] add models ( #7054 )  
						
						 
						
						... 
						
						
						
						* add qwen25vl awq models
* add moonlight
Former-commit-id: ae3be2970fea8a35907202a313ab767381c44916 
						
						
					 
					
						2025-02-24 22:05:13 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							f5cd17881e 
							
						 
					 
					
						
						
							
							[data] update vlm args ( #6976 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: c28e710636a0286d4b8a1d494529b25168a8f3ab 
						
						
					 
					
						2025-02-18 02:12:51 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							c09b648934 
							
						 
					 
					
						
						
							
							[data] add min resolution option ( #6975 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: 76bd9a98a2fb00f1a1d881e6e1364c02fd36d327 
						
						
					 
					
						2025-02-18 01:40:46 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							290057069e 
							
						 
					 
					
						
						
							
							[misc] update readme ( #6917 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: 6bbed1d8c4189fb7bea40230e278c40bb5336fbd 
						
						
					 
					
						2025-02-13 00:58:10 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eric Tang 
							
						 
					 
					
						
						
						
						
							
						
						
							5a221d91f9 
							
						 
					 
					
						
						
							
							[example] fix path to ray example ( #6906 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: e9bee3ef045d85051da04e6ad581a23a9e1a9551 
						
						
					 
					
						2025-02-13 00:29:32 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							86063e27ea 
							
						 
					 
					
						
						
							
							[data] fix ollama template ( #6902 )  
						
						 
						
						... 
						
						
						
						* fix ollama template
* add meta info
* use half precision
Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536 
						
						
					 
					
						2025-02-11 22:43:09 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							88eafd865b 
							
						 
					 
					
						
						
							
							[misc] support export ollama modelfile ( #6899 )  
						
						 
						
						... 
						
						
						
						* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20 
						
						
					 
					
						2025-02-11 19:52:25 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							332f637592 
							
						 
					 
					
						
						
							
							disable valset by default ( #6690 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533 
						
						
					 
					
						2025-01-17 21:09:30 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							7638f1070e 
							
						 
					 
					
						
						
							
							[optim] clean apollo ( #6645 )  
						
						 
						
						... 
						
						
						
						* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a 
						
						
					 
					
						2025-01-15 01:42:50 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								zhuHQ 
							
						 
					 
					
						
						
						
						
							
						
						
							c2120432db 
							
						 
					 
					
						
						
							
							[optim] add support to APOLLO ( #6617 )  
						
						 
						
						... 
						
						
						
						Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824 
						
						
					 
					
						2025-01-15 00:24:56 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hoshi-hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							2a05941b14 
							
						 
					 
					
						
						
							
							[inference] fix stop token for object detection ( #6624 )  
						
						 
						
						... 
						
						
						
						* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f 
						
						
					 
					
						2025-01-13 21:34:20 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								codingma 
							
						 
					 
					
						
						
						
						
							
						
						
							11c38b9173 
							
						 
					 
					
						
						
							
							add nf4 qlora support on Ascend NPU ( #6601 )  
						
						 
						
						... 
						
						
						
						* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 7912d1acac5f10dab22145fe729a90c57aad8d85 
						
						
					 
					
						2025-01-13 19:43:36 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							dc65ecdf09 
							
						 
					 
					
						
						
							
							refactor mllm param logic  
						
						 
						
						... 
						
						
						
						Former-commit-id: b895c190945cf5d991cb4e4dea2ae73cc9c8d246 
						
						
					 
					
						2025-01-10 15:45:48 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							944a2aec4d 
							
						 
					 
					
						
						
							
							refactor ray integration, support save ckpt  
						
						 
						
						... 
						
						
						
						Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2 
						
						
					 
					
						2025-01-07 09:39:10 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Eric Tang 
							
						 
					 
					
						
						
						
						
							
						
						
							4f31ad997c 
							
						 
					 
					
						
						
							
							run style check  
						
						 
						
						... 
						
						
						
						Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b 
						
						
					 
					
						2025-01-07 08:55:44 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Kourosh Hakhamaneshi 
							
						 
					 
					
						
						
						
						
							
						
						
							8683582300 
							
						 
					 
					
						
						
							
							drafting ray integration  
						
						 
						
						... 
						
						
						
						Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960 
						
						
					 
					
						2025-01-07 08:55:44 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Yaser Afshar 
							
						 
					 
					
						
						
						
						
							
						
						
							6f1c8dacea 
							
						 
					 
					
						
						
							
							Add missing key to init_kwargs  
						
						 
						
						... 
						
						
						
						Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca 
						
						
					 
					
						2024-12-17 12:34:05 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Yaser Afshar 
							
						 
					 
					
						
						
						
						
							
						
						
							8881237475 
							
						 
					 
					
						
						
							
							Add trust_remote_code parameter and remove True  
						
						 
						
						... 
						
						
						
						- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security
Former-commit-id: 4bf23f406cf5235c16f9f8139850c53354901814 
						
						
					 
					
						2024-12-17 12:25:12 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							8c65548b10 
							
						 
					 
					
						
						
							
							update assets  
						
						 
						
						... 
						
						
						
						Former-commit-id: 7b9bd552b2bf97b72976511094eb51dfde5d1017 
						
						
					 
					
						2024-12-14 17:36:03 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							fb22651faf 
							
						 
					 
					
						
						
							
							fix mrope  
						
						 
						
						... 
						
						
						
						Former-commit-id: 55bee1d333549ca19858b3f5c1b7b86926e5fb09 
						
						
					 
					
						2024-12-12 15:08:17 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							bac2c64f87 
							
						 
					 
					
						
						
							
							support qwen2vl train proj only  
						
						 
						
						... 
						
						
						
						Former-commit-id: 0e949ef03455726e907c6f1039e93ebe480c897a 
						
						
					 
					
						2024-12-05 10:37:42 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							39865d8a1f 
							
						 
					 
					
						
						
							
							update examples  
						
						 
						
						... 
						
						
						
						Former-commit-id: bcb010be7732ae137f156932100ee4d02a93725c 
						
						
					 
					
						2024-12-05 08:48:25 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							c1768cfb14 
							
						 
					 
					
						
						
							
							support batch infer in vllm  
						
						 
						
						... 
						
						
						
						Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5 
						
						
					 
					
						2024-12-04 13:50:00 +00:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							1e6f96508a 
							
						 
					 
					
						
						
							
							add vllm config  
						
						 
						
						... 
						
						
						
						Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb 
						
						
					 
					
						2024-11-10 21:28:18 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							ba66ac084f 
							
						 
					 
					
						
						
							
							update tests  
						
						 
						
						... 
						
						
						
						Former-commit-id: 4e92b656e324725048d914946e70867be20032ff 
						
						
					 
					
						2024-11-02 12:41:44 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							9bdba2f6a8 
							
						 
					 
					
						
						
							
							add e2e tests  
						
						 
						
						... 
						
						
						
						Former-commit-id: 0156a37450604641c4f5f9756ad84324698fc88c 
						
						
					 
					
						2024-09-05 21:52:28 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							60cf12727b 
							
						 
					 
					
						
						
							
							add rlhf-v dataset  
						
						 
						
						... 
						
						
						
						Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437 
						
						
					 
					
						2024-09-01 22:57:41 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							2f6fc27c8b 
							
						 
					 
					
						
						
							
							remove visual_inputs, fix qlora  
						
						 
						
						... 
						
						
						
						Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a 
						
						
					 
					
						2024-08-31 00:24:51 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							66a1abac6a 
							
						 
					 
					
						
						
							
							add examples  
						
						 
						
						... 
						
						
						
						Former-commit-id: 169c68921b1b8ac279834b060d9e7d38a56fe1aa 
						
						
					 
					
						2024-08-30 21:43:19 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								hiyouga 
							
						 
					 
					
						
						
						
						
							
						
						
							c62a6ca59d 
							
						 
					 
					
						
						
							
							refactor mm training  
						
						 
						
						... 
						
						
						
						Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a 
						
						
					 
					
						2024-08-30 02:14:31 +08:00  
					
					
						 
						
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								simonJJJ 
							
						 
					 
					
						
						
						
						
							
						
						
							0f3d54d8a0 
							
						 
					 
					
						
						
							
							initial-commit  
						
						 
						
						... 
						
						
						
						Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928 
						
						
					 
					
						2024-08-28 16:51:35 +08:00