Kingsley
86af92ed56
[feature] Support MPO ( #8930 )
2025-08-15 15:09:59 +08:00
golangboy
4e52325514
[file] Resolve file lock issue when deleting safetensors on Windows ( #8839 )
2025-08-08 14:59:54 +08:00
Yaowei Zheng
75e6de5425
Merge commit from fork
2025-06-26 13:55:42 +08:00
Yaowei Zheng
cd5420030c
[assets] update wechat ( #8385 )
2025-06-16 18:23:22 +08:00
Aman Gupta
61a1e4f809
[trainer] Add LD-DPO objective ( #8362 )
2025-06-12 16:10:38 +08:00
Ze-Yi LIN
7b8f560e34
[tracking] swanlab add llamafactory tag ( #8258 )
2025-06-03 18:42:29 +08:00
hoshi-hiyouga
abb581026f
[deps] update to transformers 4.52 ( #8125 )
2025-05-21 05:16:18 +08:00
hoshi-hiyouga
8a18a28624
[doc] add no build isolation ( #8103 )
2025-05-19 19:25:13 +08:00
Ma, Xiaochen
7a22989d17
[trainer] fix KeyError at end of pretrain ( #8099 )
2025-05-19 18:01:26 +08:00
Eric Tang
4ec560df1c
[ray] add storage filesystem to ray config ( #7854 )
2025-04-27 22:12:40 +08:00
hoshi-hiyouga
bc969b6ac5
[trainer] support early stop ( #7797 )
2025-04-22 01:59:33 +08:00
hoshi-hiyouga
cea9071ed1
[example] add bash usage ( #7794 )
2025-04-22 00:25:51 +08:00
Juanxi Tian
5a02c5afc2
[trainer] Add Muon Optimizer ( #7749 )
...
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-21 23:38:37 +08:00
hoshi-hiyouga
3fc56b8499
[parser] support omegaconf ( #7793 )
2025-04-21 23:30:30 +08:00
hoshi-hiyouga
8208cbf1dc
[trainer] fix pt loss ( #7748 )
...
* fix pt loss
* robust
* fix
* test
2025-04-17 03:15:35 +08:00
hoshi-hiyouga
a0818eae58
[breaking] bump transformers to 4.45.0 & improve ci ( #7746 )
...
* update ci
* fix
* fix
* fix
* fix
* fix
2025-04-17 02:36:48 +08:00
Eric Tang
5cc0d6a8f0
[ray] allow for specifying ray.init kwargs (i.e. runtime_env) ( #7647 )
...
* ray init kwargs
* Update trainer_utils.py
* fix ray args
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-04-10 11:31:05 +08:00
hoshi-hiyouga
458b6b0aef
[assets] update readme ( #7644 )
2025-04-09 01:06:06 +08:00
Shawn Tao
85f95a2883
[trainer] fix key error ( #7635 )
2025-04-08 18:39:50 +08:00
hoshi-hiyouga
fb46193364
[misc] fix packing and eval plot ( #7623 )
2025-04-07 18:20:57 +08:00
hoshi-hiyouga
40fb24916f
[model] add llama4 ( #7611 )
2025-04-06 13:42:31 +08:00
gechengze
a47370b85f
[trainer] fix batch processing in PPO trainer ( #7576 )
2025-04-02 21:17:48 +08:00
Xu-pixel
2a952305f3
[3rdparty] support swanlab lark notification ( #7481 )
2025-03-27 01:52:01 +08:00
Kdump
2c1d0b7a83
[trainer] fix wsd scheduler ( #7304 )
...
* [trainer] Warmup_stable_decay supports setting the number of stable and decay steps according to the warmup_ratio ratio
* Update trainer_utils.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-03-26 15:25:02 +08:00
hoshi-hiyouga
cb42e2c4de
[model] add qwen2vl 32b & upgrade peft ( #7469 )
...
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
2025-03-25 12:15:58 +08:00
hoshi-hiyouga
180019b376
[trainer] fix vlm loss for transformers 4.49 ( #7448 )
2025-03-24 10:24:05 +08:00
hoshi-hiyouga
1a7c872c14
[deps] upgrade transformers to 4.50.0 ( #7437 )
...
* upgrade transformers
* fix hf cache
* fix dpo trainer
2025-03-23 17:44:27 +08:00
Eric Tang
8f09c0bf96
[3rdparty] fix redundant process group destroy for ray ( #7395 )
...
* fix redundant process group destroy for ray
* Update tuner.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-03-21 10:56:47 +08:00
hoshi-hiyouga
1b1964714e
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
efa86e730c
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hiyouga
3722d04db1
release v0.9.2
...
Former-commit-id: e7ed1782d4a006400de6fc0f864abd01f7fadeea
2025-03-11 14:49:13 +08:00
Ze-Yi LIN
1358ad9afd
[tracking] add swanlab_logdir param ( #7219 )
...
* feat: add swanlab_logdir param
* fix
Former-commit-id: 9215ad488b6ac6cd57fe8fa4acdacceb63f68ca5
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
8c7917d1a2
[data] fix loader ( #7207 )
...
* fix dataloader
* add test case
* fix type
* fix ci
* fix ci
* fix ci
* disable overwrite cache in ci
Former-commit-id: e84af0e140b1aafd1a6d6fe185a8e41c8fc5f831
2025-03-07 17:20:46 +08:00
hoshi-hiyouga
63e4b14565
[misc] fix cli ( #7204 )
...
Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
25546b9afe
[model] add QwQ 32b ( #7179 )
...
Former-commit-id: 8897e48b8cd55407812453ddd4ff98ac7bdc4e91
2025-03-06 11:58:36 +08:00
Ze-Yi LIN
754dbb8b07
[trainer] fix swanlab callback ( #7176 )
...
Former-commit-id: 6d9acf4bd30db24499118aee16bd19cb19ba9e3d
2025-03-06 00:33:37 +08:00
hoshi-hiyouga
54bcc37f55
[trainer] update config ( #7174 )
...
Former-commit-id: 9f535d0e3c4ee3cd0f1b65218c2eee5d03f43c6f
2025-03-05 23:32:54 +08:00
Ze-Yi LIN
b531afb74e
[webui] display swanlab exp link ( #7089 )
...
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 27a4b93871c63b839c92940766bd7e0177972c9b
2025-02-27 19:40:54 +08:00
Eric Tang
413aa5944a
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 4be6b66b1eaa79955e936ce2b747a8837ecd1e49
2025-02-14 21:55:41 +08:00
Billy Cao
680648098b
[trainer] fix gen_kwarg to eval during training ( #5451 )
...
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 845d16122496311e08263610a6a922f82604de7b
2025-02-13 02:35:06 +08:00
marko1616
a23e16ae89
[trainer] fix llama3.2 vision kto train ( #6904 )
...
Former-commit-id: 1563e89adc8988fc6e4250634a3f1e385979b0e5
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
c5649d7149
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: 1304bbea69d8c8ca57140017515dee7ae2ee6536
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
ca5cd8276c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 8c2af7466f4015f300b51841db11bcd2505ebf20
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
c322512037
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
40b6e9045d
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: d9ae594178796994d400a5f207d6499712816f89
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
46068b3324
[breaking] support transformers 4.48 ( #6628 )
...
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
2025-01-31 01:36:33 +08:00
yinpu
5062b099f7
fix: avoid redundant normalization in DPO's SFT loss calculation ( #6722 )
...
Former-commit-id: 971a8ccbdacf130763d40c7ef82a711b2fc1292f
2025-01-21 13:38:02 +08:00
hoshi-hiyouga
33d420bbcc
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
2025-01-15 01:42:50 +08:00
zhuHQ
9b29a431db
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
7ab274eb67
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: 844919fadaa8a61dfae47020971ea80730b2346f
2025-01-13 21:34:20 +08:00