Xu-pixel
f547334604
[3rdparty] support swanlab lark notification ( #7481 )
2025-03-27 01:52:01 +08:00
Kdump
01166841cf
[trainer] fix wsd scheduler ( #7304 )
...
* [trainer] Warmup_stable_decay supports setting the number of stable and decay steps according to the warmup_ratio ratio
* Update trainer_utils.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-03-26 15:25:02 +08:00
hoshi-hiyouga
59e12bffe8
[model] add qwen2vl 32b & upgrade peft ( #7469 )
...
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
2025-03-25 12:15:58 +08:00
hoshi-hiyouga
42e090d38b
[trainer] fix vlm loss for transformers 4.49 ( #7448 )
2025-03-24 10:24:05 +08:00
hoshi-hiyouga
b1b78daf06
[deps] upgrade transformers to 4.50.0 ( #7437 )
...
* upgrade transformers
* fix hf cache
* fix dpo trainer
2025-03-23 17:44:27 +08:00
Eric Tang
d8a5571be7
[3rdparty] fix redundant process group destroy for ray ( #7395 )
...
* fix redundant process group destroy for ray
* Update tuner.py
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
2025-03-21 10:56:47 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hiyouga
f5810a6e47
release v0.9.2
...
Former-commit-id: aaad963593
2025-03-11 14:49:13 +08:00
Ze-Yi LIN
0a43bc1960
[tracking] add swanlab_logdir param ( #7219 )
...
* feat: add swanlab_logdir param
* fix
Former-commit-id: a1e76af3d9
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
df63f05b47
[data] fix loader ( #7207 )
...
* fix dataloader
* add test case
* fix type
* fix ci
* fix ci
* fix ci
* disable overwrite cache in ci
Former-commit-id: 8c3f9f6747
2025-03-07 17:20:46 +08:00
hoshi-hiyouga
113cc3d920
[misc] fix cli ( #7204 )
...
Former-commit-id: bd17223559
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
002f58ef8e
[model] add QwQ 32b ( #7179 )
...
Former-commit-id: 64a6fb9b50
2025-03-06 11:58:36 +08:00
Ze-Yi LIN
c67d2b9327
[trainer] fix swanlab callback ( #7176 )
...
Former-commit-id: 8ad03258e1
2025-03-06 00:33:37 +08:00
hoshi-hiyouga
6e58115f98
[trainer] update config ( #7174 )
...
Former-commit-id: b4b89b4ff3
2025-03-05 23:32:54 +08:00
Ze-Yi LIN
210cdb9557
[webui] display swanlab exp link ( #7089 )
...
* webui add swanlab link
* change callback name
* update
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 891c487503
2025-02-27 19:40:54 +08:00
Eric Tang
e55ec42d3c
[ray] specify ray storage path ( #6920 )
...
Former-commit-id: 6edd4992d7
2025-02-14 21:55:41 +08:00
Billy Cao
48173b606c
[trainer] fix gen_kwarg to eval during training ( #5451 )
...
* Correctly pass gen_kwarg to eval during model runs
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 11eac71c13
2025-02-13 02:35:06 +08:00
marko1616
bae934dea3
[trainer] fix llama3.2 vision kto train ( #6904 )
...
Former-commit-id: b7fd1e9c00
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
197aa3baf4
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
ff6658ad27
[deps] upgrade vllm ( #6857 )
...
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
1fee69f874
[misc] update license year & fix llama pro ( #6814 )
...
* fix llamapro script
* change year
Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
f6779b0e0c
[breaking] support transformers 4.48 ( #6628 )
...
Former-commit-id: 15357cdad9
2025-01-31 01:36:33 +08:00
yinpu
aa7c07caf0
fix: avoid redundant normalization in DPO's SFT loss calculation ( #6722 )
...
Former-commit-id: 0f45982bac
2025-01-21 13:38:02 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
hiyouga
da542fad18
imporve log
...
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
0c1ad5f3fb
fix llamaboard with ray
...
Former-commit-id: c46675d5e5
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e
run style check
...
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918
drafting ray integration
...
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com >
Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
hiyouga
8c57169eb7
fix #6546
...
Former-commit-id: 870f23d7ea
2025-01-07 06:30:44 +00:00
hiyouga
da8721a70e
fix #6499
...
Former-commit-id: 1800f8c72d
2025-01-02 11:28:54 +00:00
hiyouga
813f5919a3
fix #6482
...
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
3bcb4633ca
fix #6448
...
Former-commit-id: 2719867982
2024-12-27 16:54:39 +00:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hoshi-hiyouga
547f76e56e
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
...
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a3
2024-12-21 14:09:33 +08:00
ZeYi Lin
cc703b58f5
fix: by hiyouga suggestion
...
Former-commit-id: 3a7ea2048a
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938
feat: ui improve
...
Former-commit-id: 5f6dafd70e
2024-12-20 11:03:02 +08:00
ZeYi Lin
dd22454fc5
fix: bugs
...
Former-commit-id: d0eb64d5e3
2024-12-19 21:08:16 +08:00
hiyouga
8524dcaa4a
fix #6391
...
Former-commit-id: d4c1fda1ad
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6
feat: optimize frontend
...
Former-commit-id: 8c2df41b93
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b
feat: swanlab params
...
Former-commit-id: d5cf87990e
2024-12-19 18:47:27 +08:00
hiyouga
95d3c2620b
support disable shuffling
...
Former-commit-id: c7cedc7569
2024-12-19 08:53:21 +00:00
hiyouga
1a48340680
add swanlab
...
Former-commit-id: 96f8f103e5
2024-12-19 07:12:31 +00:00
hiyouga
a94a1eac67
support control eos, fix #6345
...
Former-commit-id: eda76de32b
2024-12-17 10:42:05 +00:00
hiyouga
50ca43c3fb
fix #6348
...
Former-commit-id: 142191e466
2024-12-17 10:06:46 +00:00
hiyouga
6f1e450739
fix mrope
...
Former-commit-id: 2811814fc4
2024-12-12 15:08:17 +00:00