Kingsley
bb1ba31005
[misc] lint mca code ( #9692 )
2025-12-29 11:44:38 +08:00
Copilot
eceec8ab69
[deps] goodbye python 3.9 ( #9677 )
...
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: hiyouga <16256802+hiyouga@users.noreply.github.com >
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
2025-12-27 02:50:44 +08:00
Yaowei Zheng
55590f5ece
[misc] fix ci with uv ( #9676 )
2025-12-27 01:39:13 +08:00
Xunpeng Xiao
3c17f2722c
[model] Update ernie_vl to adapt new version ( #9665 )
2025-12-26 19:57:49 +08:00
Yaowei Zheng
0894b4f37e
[misc] lint ( #9636 )
2025-12-20 16:19:39 +08:00
mrhaoxx
a769fb94b9
[feat] support ktransformers for dpo ( #9621 )
...
Co-authored-by: poryfly <porykid@gmail.com >
2025-12-18 21:26:25 +08:00
mrhaoxx
964569751f
[kt] refactor ktransformers integration ( #9632 )
2025-12-18 21:26:04 +08:00
浮梦
18c21bce5a
[test] add allreduce test on npu ( #9619 )
...
Co-authored-by: frozenleaves <frozen@Mac.local >
2025-12-16 21:33:30 +08:00
tangefly
4fd94141a4
[model] Add Ministral3 ( #9582 )
...
Co-authored-by: kingsley <kingsleydodonow@gmail.com >
2025-12-10 15:57:24 +08:00
Yaowei Zheng
5d56817e2b
[misc] lint ( #9593 )
...
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2025-12-09 18:00:35 +08:00
Hertz
c1f5f8fff6
[model] support GLM4.6v ( #9586 )
2025-12-09 11:06:42 +08:00
tangefly
739954910a
[deps] Update for Transformers v5 ( #9569 )
2025-12-08 01:13:32 +08:00
Peilin Li
bd30c0003b
[train] fix denominator of ga in ksft loss ( #9409 )
2025-11-05 20:53:23 +08:00
Yaowei Zheng
eaf963f67f
[model] update kt code ( #9406 )
2025-11-05 15:27:22 +08:00
Kingsley
56f45e826f
[train] fix MPO re-weight ( #9405 )
2025-11-04 21:10:41 +08:00
Peilin Li
934b3084ee
[train] KTransformers SFT as backend engine for LLaMA-Factory ( #9400 )
...
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com >
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-11-04 15:54:12 +08:00
Yaowei Zheng
3ae15da9c0
[misc] lint code ( #9395 )
2025-11-03 22:08:59 +08:00
Kingsley
13170577b2
[feat] support megatron-LM training by mcore_adapter ( #9237 )
...
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-10-26 16:21:30 +08:00
Yaowei Zheng
d9d67ba62d
[misc] fix import error ( #9299 )
2025-10-17 17:46:27 +08:00
Ben Feuer
1c44b60e3e
[feat] fp8 training ( #8960 )
...
Co-authored-by: Benjamin Feuer <penfever@gmail.com >
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-10-01 14:32:53 +08:00
Yaowei Zheng
52488ac974
[deps] upgrade transformers to 4.56.1 ( #9128 )
2025-09-14 02:26:39 +08:00
Yaowei Zheng
2c31279316
[assets] update wechat ( #8962 )
2025-08-19 02:55:09 +08:00
Zeju Qiu
003a2acb1a
[feature] adding orthogononal finetuning (OFT) to llama factory ( #8623 )
...
Co-authored-by: Zeju <zqiu@g003.internal.cluster.is.localnet >
Co-authored-by: Zeju <zqiu@login2.is.localnet >
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn >
2025-08-18 18:22:47 +08:00
XLXW
1ada15981a
[feature] add support for dft loss ( #8917 )
2025-08-15 23:29:57 +08:00
Kingsley
936f4fd78e
[feature] Support MPO ( #8930 )
2025-08-15 15:09:59 +08:00
golangboy
ef507ae8e0
[file] Resolve file lock issue when deleting safetensors on Windows ( #8839 )
2025-08-08 14:59:54 +08:00
Yaowei Zheng
2c26ce6ac4
Merge commit from fork
2025-06-26 13:55:42 +08:00
Yaowei Zheng
9a2d1dec62
[assets] update wechat ( #8385 )
2025-06-16 18:23:22 +08:00
Aman Gupta
8e4ac78607
[trainer] Add LD-DPO objective ( #8362 )
2025-06-12 16:10:38 +08:00
Ze-Yi LIN
c4e51d40e0
[tracking] swanlab add llamafactory tag ( #8258 )
2025-06-03 18:42:29 +08:00
hoshi-hiyouga
9ae17cd173
[deps] update to transformers 4.52 ( #8125 )
2025-05-21 05:16:18 +08:00
hoshi-hiyouga
beae231af6
[doc] add no build isolation ( #8103 )
2025-05-19 19:25:13 +08:00
Ma, Xiaochen
a0b4b91577
[trainer] fix KeyError at end of pretrain ( #8099 )
2025-05-19 18:01:26 +08:00
Eric Tang
ef03832cd4
[ray] add storage filesystem to ray config ( #7854 )
2025-04-27 22:12:40 +08:00
hoshi-hiyouga
fddcd43c88
[trainer] support early stop ( #7797 )
2025-04-22 01:59:33 +08:00
hoshi-hiyouga
b07628dea5
[example] add bash usage ( #7794 )
2025-04-22 00:25:51 +08:00
Juanxi Tian
12ada72ed4
[trainer] Add Muon Optimizer ( #7749 )
...
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-21 23:38:37 +08:00
hoshi-hiyouga
416853dd25
[parser] support omegaconf ( #7793 )
2025-04-21 23:30:30 +08:00
hoshi-hiyouga
39169986ef
[trainer] fix pt loss ( #7748 )
...
* fix pt loss
* robust
* fix
* test
2025-04-17 03:15:35 +08:00
hoshi-hiyouga
86ebb219d6
[breaking] bump transformers to 4.45.0 & improve ci ( #7746 )
...
* update ci
* fix
* fix
* fix
* fix
* fix
2025-04-17 02:36:48 +08:00
Eric Tang
bb8d79bae2
[ray] allow for specifying ray.init kwargs (i.e. runtime_env) ( #7647 )
...
* ray init kwargs
* Update trainer_utils.py
* fix ray args
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-10 11:31:05 +08:00
hoshi-hiyouga
1abd71b551
[assets] update readme ( #7644 )
2025-04-09 01:06:06 +08:00
Shawn Tao
acb09fa3a3
[trainer] fix key error ( #7635 )
2025-04-08 18:39:50 +08:00
hoshi-hiyouga
c3c0efbaa0
[misc] fix packing and eval plot ( #7623 )
2025-04-07 18:20:57 +08:00
hoshi-hiyouga
831e7f1cfd
[model] add llama4 ( #7611 )
2025-04-06 13:42:31 +08:00
gechengze
7b9deb9410
[trainer] fix batch processing in PPO trainer ( #7576 )
2025-04-02 21:17:48 +08:00
Xu-pixel
b578a7d5b6
[3rdparty] support swanlab lark notification ( #7481 )
2025-03-27 01:52:01 +08:00
Kdump
24afceddb7
[trainer] fix wsd scheduler ( #7304 )
...
* [trainer] Warmup_stable_decay supports setting the number of stable and decay steps according to the warmup_ratio ratio
* Update trainer_utils.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-03-26 15:25:02 +08:00
hoshi-hiyouga
0583d06676
[model] add qwen2vl 32b & upgrade peft ( #7469 )
...
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
2025-03-25 12:15:58 +08:00
hoshi-hiyouga
7203365b80
[trainer] fix vlm loss for transformers 4.49 ( #7448 )
2025-03-24 10:24:05 +08:00