482 Commits

Author SHA1 Message Date
Hertz
a71e685021
[model] support hunyuan 7b (#7317)
* [Model]supported tencent-hunyuan model

* [Model]supported tencent-hunyuan model(fix)

* [Model]supported tencent-hunyuan model(fix)
2025-03-15 20:55:24 +08:00
Qiaolin Yu
30038d9ce7
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine

* Add more tests and args

* Pass all current tests

* Clean Code

* fix sample_params

* clean code

* Fix Stream Chat

* change sglang from engine mode to server mode

* fix

* Fix Review Issues

* Use SGLang Built-In Utilities

* Fix test SGLang

* Some Doc Issue

* fix sglang engine

* add readme

---------

Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
2025-03-15 04:37:58 +08:00
hoshi-hiyouga
165d3ed084
[model] support gemma3 (#7273) 2025-03-13 01:35:23 +08:00
hoshi-hiyouga
317d0855d2 [infer] fix vllm args (#7235)
Former-commit-id: ef7af457fc44b1e8cad0c78717848617f98364f0
2025-03-11 01:15:35 +08:00
hoshi-hiyouga
5a29f49fb1 [config] update args (#7231)
Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
5a0fd22c05 [assets] update readme (#7209)
Former-commit-id: cdf8fc647819523eca8139ba62a510c3078b694d
2025-03-07 17:27:49 +08:00
hoshi-hiyouga
e7556b591e [deps] upgrade vllm (#7183)
Former-commit-id: d739fddb10f51f422c12b1e7d5d7208309bf6c91
2025-03-06 15:25:08 +08:00
hoshi-hiyouga
54a090079c [assets] update wechat (#7106)
Former-commit-id: d1863bbbaa7748e74eab34fc5a56de1102933530
2025-02-28 12:01:04 +08:00
hoshi-hiyouga
f4aa0a146c [misc] fix project toml (#7067)
Former-commit-id: 96fd510e6a03eae7a1f41772e1d6b784df6d5d2e
2025-02-25 23:22:48 +08:00
hoshi-hiyouga
9359ee18ad [assets] update readme (#7051)
Former-commit-id: fe6dd92c84cff34326f6f0ff0b648e43c372fe96
2025-02-24 20:45:06 +08:00
hoshi-hiyouga
15f3087b96 [assets] update wechat (#7019)
Former-commit-id: 1481af5dc9bc99807ae0ee5a438bf0a279cafb66
2025-02-20 20:32:33 +08:00
hoshi-hiyouga
beb1a9f9d9 [data] add r1 distill dataset (#6983)
Former-commit-id: 2591a3fa8b37fed8011fb66b266ef15e18404756
2025-02-18 17:25:09 +08:00
hoshi-hiyouga
3fbd4848e8 [version] support transformers 449 (#6982)
* support transformers 449

* fix mm plugin

Former-commit-id: b00b290c07beb560a5af857ce64f4ce424831a2c
2025-02-18 17:05:40 +08:00
hoshi-hiyouga
9b852ebe25 [misc] update readme (#6918)
Former-commit-id: 8956c93d9b05ee929fde2da0eb2365684c01f563
2025-02-13 01:01:41 +08:00
hoshi-hiyouga
07aa7b71a3 [misc] update readme (#6917)
Former-commit-id: 499ea45d1f1ea7704ee82f58c35af123a6c2632b
2025-02-13 00:58:10 +08:00
hoshi-hiyouga
1b02183da9 [misc] update readme (#6903)
Former-commit-id: 18179a3823b6222973e7c8157d21ed88fdb8c550
2025-02-11 22:51:26 +08:00
hoshi-hiyouga
c6be9e242c [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx

Former-commit-id: 9184a6e0ed7ff5f632c848f861bfa448c4cd06fc
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
703bb9cc18 [assets] update wechat (#6892)
Former-commit-id: fc5d47401f61a5c95d7a4ab7f696450cfa7b70e3
2025-02-11 13:56:26 +08:00
Zhangchi Feng
5433b318bb [da'ta] fix minicpmv plugin (#6890)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

* support dpo of minicpmv

* update init audio

* update init audio

* [model]fix image process in minicpmo

* fix no mm inputs

Former-commit-id: 764627645abcd353f9130d5dd8c584810b0e0b1b
2025-02-11 13:30:44 +08:00
hoshi-hiyouga
fcd0f0480d [dataset] add openthought (#6866)
Former-commit-id: 1356f9d8400efaccf677d0b36aaf32a146a09833
2025-02-09 00:53:01 +08:00
Zhangchi Feng
01915eaf40 [model] support audio (#6701)
* support qwen2_audio

* improve code

* lint

* fix

* fix

* fix

---------

Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 24c78429489809873a1269a735ea5421340b32a2
2025-02-05 04:59:09 +08:00
neavo
32163e7ce0 [readme] update flash attention installation instruction on win platform (#6788)
* Update README_zh.md

* Update README.md

Former-commit-id: a417bcf8d9d70057d12f9152e4091c090133e025
2025-02-01 12:43:29 +08:00
hoshi-hiyouga
445d643ef3 [model] add mistral small models (#6786)
Former-commit-id: 94803d8133fbbadff6d224cb6695feb5434fd4fd
2025-02-01 04:31:38 +08:00
hoshi-hiyouga
e8c1979b79 [model] add qwen2.5 vl models (#6779)
Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
f6779b0e0c [breaking] support transformers 4.48 (#6628)
Former-commit-id: 15357cdad953bba1f2d294819f56b9746ed1b891
2025-01-31 01:36:33 +08:00
hoshi-hiyouga
245de012ca [webui] improve webui & reasoning mode (#6778)
Former-commit-id: 45e68b9f092879dda55023ebbcd8cf4660e3045a
2025-01-31 00:09:21 +08:00
qvlehao
f5350b103b [model] add deepseek-R1 & show think process (#6767)
Former-commit-id: 28417f862a1947a24663150ca55f421198b6d8eb
2025-01-29 12:16:26 +08:00
hoshi-hiyouga
ee0b3b1e1a [assets] update wechat (#6692)
Former-commit-id: 17b470630df08786e958192287edc8199280a4b2
2025-01-18 12:35:03 +08:00
hoshi-hiyouga
320e40d873 update readme (#6648)
Former-commit-id: 563be2286a756fcd5d41b351beb8e1aa4e95842b
2025-01-15 11:06:19 +08:00
hoshi-hiyouga
9ef85f8fc4 [optim] clean apollo (#6645)
* clean apollo code

* update readme

Former-commit-id: 7a04021d0461caea2c7b82169839340b7f51f463
2025-01-15 01:42:50 +08:00
Zhangchi Feng
57043fb4e6 update readme of MiniCPM-o (#6642)
* fix template name

* tiny fix

* support minicpm-o-2.6

* support inference of minicpmv

* update readme

Former-commit-id: 9b7ba093c7e017ea18a4562550d5d2e82c4a0161
2025-01-14 21:22:35 +08:00
Haian Huang(深度眸)
864ee06243 Support InternLM3 Dense 8B Model (#6640)
* support internlm3

* update

* update

* update

* add hint

Former-commit-id: deacc00b1226ca3d53bf7bb1231cf276eaa8296b
2025-01-14 18:07:27 +08:00
Zhangchi Feng
201a495154 Support new features of MiniCPM-V (#6626)
* fix template name

* tiny fix

* support minicpm-o-2.6

Former-commit-id: c3fda5046d835ba4542d525b8d89cd12838e9f4c
2025-01-14 00:26:19 +08:00
hoshi-hiyouga
d8cba9464f [inference] fix stop token for object detection (#6624)
* fix stop token

* update minicpm data pipeline

* fix npu qlora examples

Former-commit-id: e3e2c8c689c54ebb2af264de808502e5a8ba0f2b
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51 add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU

* add transformers version check

* add python>=3.10 requirement description for npu

* tiny fix

---------

Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 03de5ac912336190d6b3583f70b6340ab9cf9cdf
2025-01-13 19:43:36 +08:00
Zhangchi Feng
15bba15725 Fix template name of MiniCPM-V (#6620)
* fix template name

* tiny fix

Former-commit-id: 3077f20339158564009270edf79c8ef1b10e8b4a
2025-01-13 16:46:48 +08:00
fzc8578
9213e48fa2 add minicpmv2.6
Former-commit-id: e45329e7456b647d5684b1f9428641ad18af92d1
2025-01-10 23:45:44 +08:00
hiyouga
b471def13d improve template, add phi4 model
Former-commit-id: ae16ea755d581a5a288fb55f12481215f369b255
2025-01-09 18:27:54 +00:00
hiyouga
528fb4f799 update model name
Former-commit-id: 4b8add728729d8e2ce4c9a3dc6748357291d8e8b
2025-01-02 12:19:21 +00:00
hoshi-hiyouga
aa7ec44367 Merge pull request #6514 from hiyouga/hiyouga/add_project
[readme] add project

Former-commit-id: a766cad5d49f226eb61a550bc3d157870c1068cc
2025-01-02 20:16:15 +08:00
hiyouga
9a3afbd5d1 add project
Former-commit-id: b3e1137fbbdfa4cc081903983fea36acff7afd75
2025-01-02 12:15:41 +00:00
hiyouga
37c60c7d14 add gpt2 model
Former-commit-id: 67442bd497c75b0c5990d94a880e0e25474ae2fa
2025-01-02 12:07:38 +00:00
hiyouga
d0e729cd33 add deepseek3 model
Former-commit-id: e67b9dcc3ad0c003bc3afd7601ecd2adfbf9666b
2024-12-30 13:39:20 +00:00
hiyouga
c83b74ab9e add qvq #6439
Former-commit-id: ee0e400f417f648cd15cf48144df76e4809cc615
2024-12-25 07:52:41 +00:00
hiyouga
353259f03f update readme
Former-commit-id: 8fd38d273e5bc3b28a4741b230010fece87e7070
2024-12-23 14:08:59 +00:00
hoshi-hiyouga
8265d6a228 Merge pull request #5922 from Tuyohai/main
support granite3 models

Former-commit-id: c23a4d0658323434c386716c25855711202e37a9
2024-12-23 16:46:02 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
ZeYi Lin
1c1d6bea43 docs: use swanlab
Former-commit-id: 744ef8c2688efad82028e22683e6c9d874af6823
2024-12-21 20:59:25 +08:00
hiyouga
433d116080 add paligemma2
Former-commit-id: d3509050dc4d3105a6e62acc9a1ba481269279a2
2024-12-18 08:57:26 +00:00
hoshi-hiyouga
d43080b534 Merge pull request #6313 from ge-xing/main
support telechat2 model

Former-commit-id: 015f2137887bb9f27fcb0d6cc67ef729aad4031e
2024-12-18 16:16:17 +08:00