89 Commits

Author SHA1 Message Date
Yaowei Zheng
5817583630
[deps] bump transformers to 4.49.0 (#8564) 2025-07-07 20:31:50 +08:00
Ze-Yi LIN
8e7727f4ee
[tracking] fix swanlab hparams (#8532)
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
2025-07-02 22:08:44 +08:00
Kingsley
4465e4347e
[assets] update readme (#8519) 2025-07-02 15:38:38 +08:00
Injae Ryou
ac6c93df1f
[parser] update config loading to use OmegaConf #7793 (#8505) 2025-07-01 21:05:13 +08:00
Yaowei Zheng
0e1fea71d2
[assets] update readme (#8396) 2025-06-17 16:15:20 +08:00
Yaowei Zheng
af2f75e688
[data] fix qwen2vl pos ids (#8387) 2025-06-17 00:48:54 +08:00
hoshi-hiyouga
f3fd67a9bb
[model] switch to gptqmodel (#8108) 2025-05-19 22:25:40 +08:00
hoshi-hiyouga
a8430f4244
[misc] fix uv (#7913) 2025-04-30 07:45:03 +08:00
hoshi-hiyouga
a62cba3d05
[example] add bash usage (#7794) 2025-04-22 00:25:51 +08:00
Juanxi Tian
d128382d3c
[trainer] Add Muon Optimizer (#7749)
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
2025-04-21 23:38:37 +08:00
hoshi-hiyouga
278df4308d
[parser] support omegaconf (#7793) 2025-04-21 23:30:30 +08:00
hoshi-hiyouga
4831552856
[infer] set env for vllm ascend (#7745) 2025-04-17 01:08:55 +08:00
hoshi-hiyouga
0fe5631f9b
[deps] upgrade vllm (#7728) 2025-04-15 14:57:40 +08:00
hoshi-hiyouga
3ef36d0057
[misc] upgrade cli (#7714) 2025-04-14 15:41:22 +08:00
hoshi-hiyouga
5817cda37e
[misc] fix packing and eval plot (#7623) 2025-04-07 18:20:57 +08:00
hoshi-hiyouga
aaf2e6ba2a
[model] fix kv cache (#7564) 2025-04-01 23:07:46 +08:00
hoshi-hiyouga
dfbe1391e9
[deps] upgrade vllm to 0.8 (#7436) 2025-03-23 14:32:22 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine

* Add more tests and args

* Pass all current tests

* Clean Code

* fix sample_params

* clean code

* Fix Stream Chat

* change sglang from engine mode to server mode

* fix

* Fix Review Issues

* Use SGLang Built-In Utilities

* Fix test SGLang

* Some Doc Issue

* fix sglang engine

* add readme

---------

Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
9ccfb97a2c
[misc] update format (#7277) 2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hoshi-hiyouga
4e68828e46 [config] fix export max len (#7230)
Former-commit-id: 728c2f68190129e88ac375b453c60ed0d6bebbd8
2025-03-10 16:46:08 +08:00
hoshi-hiyouga
113cc3d920 [misc] fix cli (#7204)
Former-commit-id: bd1722355944a3618f44a4ac3e8177c34b97dc75
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
e7556b591e [deps] upgrade vllm (#7183)
Former-commit-id: d739fddb10f51f422c12b1e7d5d7208309bf6c91
2025-03-06 15:25:08 +08:00
hoshi-hiyouga
036fb0d561 [misc] fix grad ckpt func (#6916)
Former-commit-id: e34c3c06da706f80c74c20800f19110e9ad6b82a
2025-02-13 00:17:18 +08:00
hoshi-hiyouga
2e2f6bea07 [data] feat: auto template (#6905)
* support auto template

* add unittest

Former-commit-id: 2f8b6847f5e199d770e91346dfe205c4b9f1fbb7
2025-02-12 00:22:53 +08:00
hoshi-hiyouga
ff6658ad27 [deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
2025-02-08 15:02:28 +08:00
hoshi-hiyouga
f70208e1c0 [misc] allow extra args (#6831)
Former-commit-id: 74ade3a176cad753971aaad681fea6ff8df40914
2025-02-06 12:38:08 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
5e699458e5 pin vllm version to 0.6.5 (#6629)
Former-commit-id: 1c7663d3049e00a9148c3e3c58204deca7a08c8d
2025-01-14 02:44:02 +08:00
hoshi-hiyouga
b777fed171 Merge pull request #6564 from stephen-nju/fix_ray
Fix ray

Former-commit-id: 6b34b69fa688c4622489d3d5f33d847fb6b95528
2025-01-08 18:14:18 +08:00
zhubin
014a7ea042 fix –get ray args when args not a dict
Former-commit-id: 9c4c84828b77acf48caf60726e4e7ef3e972118d
2025-01-08 10:06:02 +00:00
hiyouga
da542fad18 imporve log
Former-commit-id: 47e17dd689840ca9b3c5f34448e5f80265336cca
2025-01-08 09:56:10 +00:00
hiyouga
b4174021d6 refactor ray integration, support save ckpt
Former-commit-id: d8cac6f54663e6cffeddf2c65e3da454e7b86a75
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e run style check
Former-commit-id: 1e8e7be0a535e55888f58bbe2c38bc1c382e9012
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918 drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Former-commit-id: 163ddb680b6f84a4424a887a3b8a5d668044e87c
2025-01-07 08:55:44 +00:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hiyouga
235cdcacee support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
2024-12-04 13:50:00 +00:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
hiyouga
e83cb17f97 support rank0 logger
Former-commit-id: c38aa29336f286266553da4909a7267d7ef21f37
2024-11-02 18:31:04 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dda5d236200bad4aa3f2e29ae8930fe2
2024-10-11 23:51:54 +08:00
Johnny
27be1e2122 Update parser.py
Former-commit-id: e5849cdcce109e15547edcf9a692e7c13d625e5a
2024-10-11 12:29:33 +02:00
hoshi-hiyouga
b855d3421e Update parser.py
Former-commit-id: 1ce0b42b1e30cb5419c91702a499f23d52db43ee
2024-10-07 16:27:23 +08:00
Johnny
059c2ffbea Update parser.py
Former-commit-id: 4e638777ebcbf7dea22011361fb341bafe6ba9d9
2024-10-07 10:17:45 +02:00
Johnny
9a6045eee6 Update parser.py
Former-commit-id: 6c1aef55604649a956fe928d89280626923815b8
2024-10-06 20:34:19 +02:00
hiyouga
56132983cf fix #5611
Former-commit-id: a45f3f5461e2936b9e119eda2ef4d8c7a4131740
2024-10-06 10:34:55 +08:00
hiyouga
78cf256067 support vllm 0.6.0
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
2024-09-08 02:26:20 +08:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00