75 Commits

Author SHA1 Message Date
hoshi-hiyouga
60a84f664b
[deps] fix uv conflicts (#7686)
* fix #7678

* Update setup.py

* Update tests.yml

* Update publish.yml

* Update Makefile
2025-04-11 18:02:24 +08:00
hoshi-hiyouga
39876b85fc
[assets] update readme (#7644) 2025-04-09 01:06:06 +08:00
Adarsh Shirawalmath
8ee26642f3
[sglang] support transformers 4.51.0 (#7639) 2025-04-08 18:39:23 +08:00
hoshi-hiyouga
5817cda37e
[misc] fix packing and eval plot (#7623) 2025-04-07 18:20:57 +08:00
hoshi-hiyouga
aaf2e6ba2a
[model] fix kv cache (#7564) 2025-04-01 23:07:46 +08:00
Xiaosu Zhu
bc9ada9db7
[misc] update liger-kernel's monkey patch (#7453)
* Update liger_kernel.py

* Update setup.py
2025-03-25 11:58:52 +08:00
hoshi-hiyouga
dfbe1391e9
[deps] upgrade vllm to 0.8 (#7436) 2025-03-23 14:32:22 +08:00
Guo, Quan
ebc989ad4a
[misc] fix sglang deps (#7432)
* feat: Add transformer version requirement for sglang

* feat: add srt to sglang which is required for running sglang

Other options are srt_hip, srt_xpu, srt_npu, srt_hpu, srt_cpu, for different computation architectures.
2025-03-23 14:07:10 +08:00
hoshi-hiyouga
555b71a1cb
[version] fix minicpmo (#7378) 2025-03-20 16:59:31 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine

* Add more tests and args

* Pass all current tests

* Clean Code

* fix sample_params

* clean code

* Fix Stream Chat

* change sglang from engine mode to server mode

* fix

* Fix Review Issues

* Use SGLang Built-In Utilities

* Fix test SGLang

* Some Doc Issue

* fix sglang engine

* add readme

---------

Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hoshi-hiyouga
b256ca86f0
[core] release v0.9.2 (#7254) 2025-03-11 22:42:23 +08:00
hoshi-hiyouga
5a29f49fb1 [config] update args (#7231)
Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
e7556b591e [deps] upgrade vllm (#7183)
Former-commit-id: d739fddb10f51f422c12b1e7d5d7208309bf6c91
2025-03-06 15:25:08 +08:00
leo-pony
e86cb8a4fa [npu] update cann base image and torch 2.4 (#7061)
* Update base npu container image version:The Python version required for Hugging Face Transformers is >= python3.10

* Fix the bug: arg type of INSTALL_DEEPSPEED shoud been string now.

* Update Ascend CANN, CANN-Kernel and corresponding torch and torch-npu version

* Upgrade torch-npu needs packages' version: torch==2.1.0 and torch-npu==2.4.0.post2

Former-commit-id: acc52e0fe75d59c2581ef45346ebfbffe45f762b
2025-02-25 23:32:01 +08:00
hoshi-hiyouga
ff6658ad27 [deps] upgrade vllm (#6857)
Former-commit-id: 5f38bcaba921dbdee27b4be4709fcec06fa37c9e
2025-02-08 15:02:28 +08:00
Zhangchi Feng
01915eaf40 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
hoshi-hiyouga
1fee69f874 [misc] update license year & fix llama pro (#6814)
* fix llamapro script

* change year

Former-commit-id: e2dc5b952aa22835d5220ba624f44676138b65ac
2025-02-05 01:53:33 +08:00
hoshi-hiyouga
f6779b0e0c [breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
2025-01-31 01:36:33 +08:00
zhuHQ
763f9b9df0 [optim] add support to APOLLO (#6617)
Former-commit-id: d9189f9f0b23ff6929044919208e0e813ca95b1c
2025-01-15 00:24:56 +08:00
Zhangchi Feng
ad119afc58 Support Inference of MiniCPM-V-2.6 and MiniCPM-o-2.6 (#6631)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

Former-commit-id: 158a127d340d5e4ca23263ffad042f861fd77deb
2025-01-14 17:34:58 +08:00
hoshi-hiyouga
5e699458e5 pin vllm version to 0.6.5 (#6629)
Former-commit-id: 1c7663d3049e00a9148c3e3c58204deca7a08c8d
2025-01-14 02:44:02 +08:00
Zhangchi Feng
201a495154 Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6

Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
2025-01-14 00:26:19 +08:00
hiyouga
c89d17ab63 refactor mllm param logic
Former-commit-id: f6f630a1c96514053176abb12e35a06242e62abd
2025-01-10 15:45:48 +00:00
hiyouga
813f5919a3 fix #6482
Former-commit-id: 6f5bb3b8e5b6eb7fdfd7b0ca8eba789ab741a7b6
2024-12-30 06:03:07 +00:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
hiyouga
235cdcacee support batch infer in vllm
Former-commit-id: 1324d158f954d777f1fbf09f46149c372704b388
2024-12-04 13:50:00 +00:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
hiyouga
0d8aa6e6ef use pre-commit
Former-commit-id: 21db8ed2f4a0eba203754a92ce0741538e8ee709
2024-10-29 09:07:46 +00:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dda5d236200bad4aa3f2e29ae8930fe2
2024-10-11 23:51:54 +08:00
Johnny
39929bda5a Update setup.py
Former-commit-id: cb56673675a9d2ba359dd6d9a561084b9bad7bfd
2024-10-11 12:29:09 +02:00
hoshi-hiyouga
ef87942a42 Update setup.py
Former-commit-id: c2e947de28841c1125991c4ebd60c13f763bf027
2024-10-07 16:26:50 +08:00
Johnny
93f14bf121 Update setup.py
Former-commit-id: 6b78d02175c3e03f7304aed64b9fa21e0a71dd0d
2024-10-07 10:16:53 +02:00
Johnny
332614579c Update setup.py
Former-commit-id: e57f607cb4a6a59b695483b760a0b7e3b762695a
2024-10-06 08:53:55 +02:00
hiyouga
3aefdad4ec release v0.9.0 (real)
Former-commit-id: 90d6df622252c6fad985f68b97771c979357e2fc
2024-09-09 01:00:25 +08:00
hiyouga
78cf256067 support vllm 0.6.0
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
2024-09-08 02:26:20 +08:00
hiyouga
53aeacae81 fix #5252
Former-commit-id: ebddce5d4a2ebb40ce71462f81701e274eb3c29f
2024-09-04 03:17:54 +08:00
hiyouga
c2df70e925 add extra requires
Former-commit-id: d14edd350ddf268cfeea0f4e9e3c43f38516b848
2024-08-27 12:52:12 +08:00
hiyouga
5eacd17090 add adam_mini to readme
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
2024-08-09 20:02:03 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
6b2733ce12 improve autogptq integration
Former-commit-id: addca926de42f91366185a47eb8e777ed44a8e77
2024-06-26 22:11:44 +08:00
hiyouga
a225b5a70c tiny fix about badam
Former-commit-id: 095fab58d3692607c9e78747b4218ae1abcf5aaf
2024-06-25 01:54:53 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
19ea51e460 fix #4271
Former-commit-id: f1aa6a411a55efea1064ea74b936ac8a3d856f8c
2024-06-15 05:11:33 +08:00
hiyouga
d3eb985bb6 fix ci
Former-commit-id: 7f20e4722ae6ac907b36a3219dcd09d2ff5d071a
2024-06-08 01:57:36 +08:00
hiyouga
2f0a333e9c update readme
Former-commit-id: 53eb2de75e2df372b87801cea4ccafd6e73e59df
2024-06-06 16:59:18 +08:00
hiyouga
8f3b8ade45 fix setup
Former-commit-id: 4dc0632145f555ac564b0d6b560d28cbcb3ddded
2024-06-06 01:39:02 +08:00
hiyouga
ecd06d0110 fix #4079
Former-commit-id: 83a005e3d404f5a8ccb7b8ac17c50db75df4e8d4
2024-06-05 16:56:54 +08:00
hiyouga
d0ceb1b091 fix #4022
Former-commit-id: 24e1c0e2ee365812357fb9cf82e2162915299a26
2024-06-03 18:38:36 +08:00
hiyouga
a71a6a05c3 update readme
Former-commit-id: 89ca832740731dfb121175aa5c16b13bd4944011
2024-05-29 18:39:11 +08:00