Yaowei Zheng
|
55590f5ece
|
[misc] fix ci with uv (#9676)
|
2025-12-27 01:39:13 +08:00 |
|
Yaowei Zheng
|
6ef9854713
|
[misc] fix cache & pin transformers to 4.57.1 (#9638)
|
2025-12-22 00:20:55 +08:00 |
|
Yaowei Zheng
|
aeda079014
|
[v1] model loader (#9613)
|
2025-12-14 11:50:52 +08:00 |
|
Yaowei Zheng
|
eaf963f67f
|
[model] update kt code (#9406)
|
2025-11-05 15:27:22 +08:00 |
|
Peilin Li
|
934b3084ee
|
[train] KTransformers SFT as backend engine for LLaMA-Factory (#9400)
Co-authored-by: jimmy128 <jimmy128@noreply.gitcode.com>
Co-authored-by: Yaowei Zheng <hiyouga@buaa.edu.cn>
|
2025-11-04 15:54:12 +08:00 |
|
Yaowei Zheng
|
1037f63311
|
[model] add qwen3vl 4b + 8b (#9275)
|
2025-10-15 15:00:36 +08:00 |
|
Yaowei Zheng
|
af8437095a
|
[ci] Change macOS version (#9229)
|
2025-10-05 02:18:30 +08:00 |
|
Yaowei Zheng
|
6ffebe5ff7
|
[data] fix qwen omni plugin (#9204)
Co-authored-by: kingsley <kingsleydodonow@gmail.com>
|
2025-09-28 01:02:29 +08:00 |
|
Yaowei Zheng
|
52488ac974
|
[deps] upgrade transformers to 4.56.1 (#9128)
|
2025-09-14 02:26:39 +08:00 |
|
Yaowei Zheng
|
4dfad24902
|
[model] add gpt oss (#8826)
|
2025-08-06 05:56:46 +08:00 |
|
Yaowei Zheng
|
7f8e5f52f9
|
[webui] fix abort finish (#8569)
|
2025-07-07 23:07:46 +08:00 |
|
Yaowei Zheng
|
12ed792db9
|
[webui] support other hub (#8567)
|
2025-07-07 22:18:48 +08:00 |
|
Yaowei Zheng
|
4b0ec83928
|
[deps] bump transformers to 4.49.0 (#8564)
|
2025-07-07 20:31:50 +08:00 |
|
Yaowei Zheng
|
4407231a3b
|
[webui] upgrade webui and fix api (#8460)
|
2025-06-25 21:59:58 +08:00 |
|
Yaowei Zheng
|
f276b9a963
|
[model] do not force load processor (#8457)
|
2025-06-25 19:43:00 +08:00 |
|
Yaowei Zheng
|
fee2122f09
|
[deps] upgrade transformers to 4.52.4 (#8245)
|
2025-05-31 16:51:40 +08:00 |
|
hoshi-hiyouga
|
ba032828e2
|
[deps] upgrade transformers (#8159)
|
2025-05-26 22:03:58 +08:00 |
|
hoshi-hiyouga
|
9ae17cd173
|
[deps] update to transformers 4.52 (#8125)
|
2025-05-21 05:16:18 +08:00 |
|
hoshi-hiyouga
|
45030ff803
|
[model] switch to gptqmodel (#8108)
|
2025-05-19 22:25:40 +08:00 |
|
hoshi-hiyouga
|
39169986ef
|
[trainer] fix pt loss (#7748)
* fix pt loss
* robust
* fix
* test
|
2025-04-17 03:15:35 +08:00 |
|
hoshi-hiyouga
|
86ebb219d6
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
hoshi-hiyouga
|
1134baeedd
|
[assets] update model readme (#7724)
|
2025-04-15 00:41:09 +08:00 |
|
hoshi-hiyouga
|
3f91a95250
|
[misc] fix env vars (#7715)
|
2025-04-14 16:04:04 +08:00 |
|
hoshi-hiyouga
|
f518bfba5b
|
[deps] upgrade transformers (#7704)
|
2025-04-13 18:11:34 +08:00 |
|
jilongW
|
1b0934bccb
|
[misc] fix cuda warn on intel GPU (#7655)
|
2025-04-09 21:37:54 +08:00 |
|
hoshi-hiyouga
|
4eec541857
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
hoshi-hiyouga
|
1abd71b551
|
[assets] update readme (#7644)
|
2025-04-09 01:06:06 +08:00 |
|
hoshi-hiyouga
|
831e7f1cfd
|
[model] add llama4 (#7611)
|
2025-04-06 13:42:31 +08:00 |
|
hoshi-hiyouga
|
2bfcad2394
|
[model] fix kv cache (#7564)
|
2025-04-01 23:07:46 +08:00 |
|
hoshi-hiyouga
|
0583d06676
|
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
|
2025-03-25 12:15:58 +08:00 |
|
GuoCoder
|
ec6a261568
|
[model] fix lora on quant models (#7456)
Co-authored-by: root <root@ai>
|
2025-03-25 11:59:46 +08:00 |
|
hoshi-hiyouga
|
05b19d6952
|
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers
* fix hf cache
* fix dpo trainer
|
2025-03-23 17:44:27 +08:00 |
|
Qiaolin Yu
|
a44a53ebec
|
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-15 04:37:58 +08:00 |
|
hoshi-hiyouga
|
650a9a9057
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
e6159ad730
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hoshi-hiyouga
|
264538cb26
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
1d675a287d
|
[version] support transformers 449 (#6982)
* support transformers 449
* fix mm plugin
Former-commit-id: e9118a9df0839d24f6ddff5a0b55ef101a1d3d22
|
2025-02-18 17:05:40 +08:00 |
|
hoshi-hiyouga
|
4d1791e905
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
a28261a866
|
[model] add mistral small models (#6786)
Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
222423bcef
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
647c51a772
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
944a2aec4d
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
hiyouga
|
f8f05a883b
|
fix #6482
Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
c1768cfb14
|
support batch infer in vllm
Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5
|
2024-12-04 13:50:00 +00:00 |
|
Ting
|
87b1f851f1
|
code refactor
Former-commit-id: ee3f85aa9677d0aeecb3bc396530d2cd7c50dce5
|
2024-11-19 20:33:18 +08:00 |
|
hiyouga
|
b104739d63
|
update datasets version
Former-commit-id: feba2c6418a15715fee77a34428fa3cf47fcee5b
|
2024-11-04 07:52:26 +00:00 |
|
hiyouga
|
093eda2ad6
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
8185eb1890
|
fix incorrect loss value for vlms
Former-commit-id: 0aa29a71ce958343a2086090d647eb63b8f5f5be
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
8f5921692e
|
update requires
Former-commit-id: cae0e688ddcead370821e126c192bddc53ff6017
|
2024-10-29 16:10:07 +08:00 |
|
hiyouga
|
c7efc7f2ed
|
tiny fix
Former-commit-id: 1fe424323b212094856f423351dc2a15774d39c3
|
2024-10-11 23:51:54 +08:00 |
|