Commit Graph

152 Commits

Author SHA1 Message Date
hoshi-hiyouga
64a6fb9b50 [model] add QwQ 32b (#7179) 2025-03-06 11:58:36 +08:00
Ze-Yi LIN
8ad03258e1 [trainer] fix swanlab callback (#7176) 2025-03-06 00:33:37 +08:00
hoshi-hiyouga
b4b89b4ff3 [trainer] update config (#7174) 2025-03-05 23:32:54 +08:00
Ze-Yi LIN
891c487503 [webui] display swanlab exp link (#7089)
* webui add swanlab link

* change callback name

* update

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-02-27 19:40:54 +08:00
Eric Tang
6edd4992d7 [ray] specify ray storage path (#6920) 2025-02-14 21:55:41 +08:00
Billy Cao
11eac71c13 [trainer] fix gen_kwarg to eval during training (#5451)
* Correctly pass gen_kwarg to eval during model runs

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-02-13 02:35:06 +08:00
marko1616
b7fd1e9c00 [trainer] fix llama3.2 vision kto train (#6904) 2025-02-12 19:09:14 +08:00
hoshi-hiyouga
e1a7c1242c [data] fix ollama template (#6902)
* fix ollama template

* add meta info

* use half precision
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
9184a6e0ed [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
5f38bcaba9 [deps] upgrade vllm (#6857) 2025-02-08 15:02:28 +08:00
hoshi-hiyouga
e2dc5b952a [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
15357cdad9 [breaking] support transformers 4.48 (#6628) 2025-01-31 01:36:33 +08:00
yinpu
0f45982bac fix: avoid redundant normalization in DPO's SFT loss calculation (#6722) 2025-01-21 13:38:02 +08:00
hoshi-hiyouga
7a04021d04 [optim] clean apollo (#6645)
* clean apollo code

* update readme
2025-01-15 01:42:50 +08:00
zhuHQ
d9189f9f0b [optim] add support to APOLLO (#6617) 2025-01-15 00:24:56 +08:00
hoshi-hiyouga
e3e2c8c689 [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples
2025-01-13 21:34:20 +08:00
hiyouga
47e17dd689 imporve log 2025-01-08 09:56:10 +00:00
hiyouga
c46675d5e5 fix llamaboard with ray 2025-01-07 09:59:24 +00:00
hiyouga
d8cac6f546 refactor ray integration, support save ckpt 2025-01-07 09:39:10 +00:00
Eric Tang
1e8e7be0a5 run style check 2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
163ddb680b drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
2025-01-07 08:55:44 +00:00
hiyouga
870f23d7ea fix #6546 2025-01-07 06:30:44 +00:00
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hiyouga
2719867982 fix #6448 2024-12-27 16:54:39 +00:00
hiyouga
5111cac6f8 support report custom args 2024-12-21 21:42:45 +00:00
hoshi-hiyouga
947e22a4a3 Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
2024-12-21 14:09:33 +08:00
ZeYi Lin
3a7ea2048a fix: by hiyouga suggestion 2024-12-20 16:43:03 +08:00
ZeYi Lin
5f6dafd70e feat: ui improve 2024-12-20 11:03:02 +08:00
ZeYi Lin
d0eb64d5e3 fix: bugs 2024-12-19 21:08:16 +08:00
hiyouga
d4c1fda1ad fix #6391 2024-12-19 12:16:38 +00:00
ZeYi Lin
8c2df41b93 feat: optimize frontend 2024-12-19 19:04:19 +08:00
ZeYi Lin
d5cf87990e feat: swanlab params 2024-12-19 18:47:27 +08:00
hiyouga
c7cedc7569 support disable shuffling 2024-12-19 08:53:21 +00:00
hiyouga
96f8f103e5 add swanlab 2024-12-19 07:12:31 +00:00
hiyouga
eda76de32b support control eos, fix #6345 2024-12-17 10:42:05 +00:00
hiyouga
142191e466 fix #6348 2024-12-17 10:06:46 +00:00
hiyouga
2811814fc4 fix mrope 2024-12-12 15:08:17 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
hoshi-hiyouga
bd639a137e Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation
support effective tokens calculation on sft/dpo
2024-11-20 13:43:15 +08:00
Ting
40627c601e code refactor 2024-11-19 20:33:18 +08:00
Ting
f566ecc8d1 update 2024-11-19 19:12:10 +08:00
Ting
ef6e14550d update 2024-11-19 19:10:07 +08:00
Ting
b9f00286d8 support efficient tokens calculation on sft/dpo 2024-11-19 17:15:47 +08:00
hoshi-hiyouga
dc82821872 fix #6050 2024-11-16 16:11:16 +08:00
hiyouga
4270f7dfb9 fix dpo metrics 2024-11-02 20:59:01 +08:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
hiyouga
93d3b8f43f update tests 2024-11-02 12:41:44 +08:00
hiyouga
30567a1487 fix incorrect loss value for vlms 2024-10-30 08:56:46 +00:00
hiyouga
23dbe9a099 fix #5749 2024-10-29 13:02:13 +00:00