Commit Graph

169 Commits

Author SHA1 Message Date
hoshi-hiyouga
39876b85fc [assets] update readme (#7644) 2025-04-09 01:06:06 +08:00
Shawn Tao
8f5f4cc559 [trainer] fix key error (#7635) 2025-04-08 18:39:50 +08:00
hoshi-hiyouga
5817cda37e [misc] fix packing and eval plot (#7623) 2025-04-07 18:20:57 +08:00
hoshi-hiyouga
6c200fd218 [model] add llama4 (#7611) 2025-04-06 13:42:31 +08:00
gechengze
11997593be [trainer] fix batch processing in PPO trainer (#7576) 2025-04-02 21:17:48 +08:00
Xu-pixel
f547334604 [3rdparty] support swanlab lark notification (#7481) 2025-03-27 01:52:01 +08:00
Kdump
01166841cf [trainer] fix wsd scheduler (#7304)
* [trainer] Warmup_stable_decay supports setting the number of stable and decay steps according to the warmup_ratio ratio

* Update trainer_utils.py

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-03-26 15:25:02 +08:00
hoshi-hiyouga
59e12bffe8 [model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b

* fix ci

* upgrade peft to 0.15

* fix ci

* fix ci
2025-03-25 12:15:58 +08:00
hoshi-hiyouga
42e090d38b [trainer] fix vlm loss for transformers 4.49 (#7448) 2025-03-24 10:24:05 +08:00
hoshi-hiyouga
b1b78daf06 [deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers

* fix hf cache

* fix dpo trainer
2025-03-23 17:44:27 +08:00
Eric Tang
d8a5571be7 [3rdparty] fix redundant process group destroy for ray (#7395)
* fix redundant process group destroy for ray

* Update tuner.py

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-03-21 10:56:47 +08:00
hoshi-hiyouga
9ccfb97a2c [misc] update format (#7277) 2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f [misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hiyouga
f5810a6e47 release v0.9.2
Former-commit-id: aaad963593
2025-03-11 14:49:13 +08:00
Ze-Yi LIN
0a43bc1960 [tracking] add swanlab_logdir param (#7219)
* feat: add swanlab_logdir param

* fix

Former-commit-id: a1e76af3d9
2025-03-11 00:53:07 +08:00
hoshi-hiyouga
df63f05b47 [data] fix loader (#7207)
* fix dataloader

* add test case

* fix type

* fix ci

* fix ci

* fix ci

* disable overwrite cache in ci

Former-commit-id: 8c3f9f6747
2025-03-07 17:20:46 +08:00
hoshi-hiyouga
113cc3d920 [misc] fix cli (#7204)
Former-commit-id: bd17223559
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
002f58ef8e [model] add QwQ 32b (#7179)
Former-commit-id: 64a6fb9b50
2025-03-06 11:58:36 +08:00
Ze-Yi LIN
c67d2b9327 [trainer] fix swanlab callback (#7176)
Former-commit-id: 8ad03258e1
2025-03-06 00:33:37 +08:00
hoshi-hiyouga
6e58115f98 [trainer] update config (#7174)
Former-commit-id: b4b89b4ff3
2025-03-05 23:32:54 +08:00
Ze-Yi LIN
210cdb9557 [webui] display swanlab exp link (#7089)
* webui add swanlab link

* change callback name

* update

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 891c487503
2025-02-27 19:40:54 +08:00
Eric Tang
e55ec42d3c [ray] specify ray storage path (#6920)
Former-commit-id: 6edd4992d7
2025-02-14 21:55:41 +08:00
Billy Cao
48173b606c [trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 11eac71c13
2025-02-13 02:35:06 +08:00
marko1616
bae934dea3 [trainer] fix llama3.2 vision kto train (#6904)
Former-commit-id: b7fd1e9c00
2025-02-12 19:09:14 +08:00
hoshi-hiyouga
197aa3baf4 [data] fix ollama template (#6902)
* fix ollama template

* add meta info

* use half precision

Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx

Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
ff6658ad27 [deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba9
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
1fee69f874 [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year

Former-commit-id: e2dc5b952a
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
f6779b0e0c [breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad9
2025-01-31 01:36:33 +08:00
yinpu
aa7c07caf0 fix: avoid redundant normalization in DPO's SFT loss calculation (#6722)
Former-commit-id: 0f45982bac
2025-01-21 13:38:02 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689
2025-01-08 09:56:10 +00:00
hiyouga
0c1ad5f3fb fix llamaboard with ray
Former-commit-id: c46675d5e5
2025-01-07 09:59:24 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
hiyouga
8c57169eb7 fix #6546
Former-commit-id: 870f23d7ea
2025-01-07 06:30:44 +00:00
hiyouga
da8721a70e fix #6499
Former-commit-id: 1800f8c72d
2025-01-02 11:28:54 +00:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5
2024-12-30 06:03:07 +00:00
hiyouga
3bcb4633ca fix #6448
Former-commit-id: 2719867982
2024-12-27 16:54:39 +00:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8
2024-12-21 21:42:45 +00:00
hoshi-hiyouga
547f76e56e Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
Former-commit-id: 947e22a4a3
2024-12-21 14:09:33 +08:00
ZeYi Lin
cc703b58f5 fix: by hiyouga suggestion
Former-commit-id: 3a7ea2048a
2024-12-20 16:43:03 +08:00
ZeYi Lin
8f786ee938 feat: ui improve
Former-commit-id: 5f6dafd70e
2024-12-20 11:03:02 +08:00
ZeYi Lin
dd22454fc5 fix: bugs
Former-commit-id: d0eb64d5e3
2024-12-19 21:08:16 +08:00
hiyouga
8524dcaa4a fix #6391
Former-commit-id: d4c1fda1ad
2024-12-19 12:16:38 +00:00
ZeYi Lin
53103f55b6 feat: optimize frontend
Former-commit-id: 8c2df41b93
2024-12-19 19:04:19 +08:00
ZeYi Lin
cc5cde734b feat: swanlab params
Former-commit-id: d5cf87990e
2024-12-19 18:47:27 +08:00