hoshi-hiyouga
|
86ebb219d6
|
[breaking] bump transformers to 4.45.0 & improve ci (#7746)
* update ci
* fix
* fix
* fix
* fix
* fix
|
2025-04-17 02:36:48 +08:00 |
|
hoshi-hiyouga
|
1134baeedd
|
[assets] update model readme (#7724)
|
2025-04-15 00:41:09 +08:00 |
|
hoshi-hiyouga
|
3f91a95250
|
[misc] fix env vars (#7715)
|
2025-04-14 16:04:04 +08:00 |
|
hoshi-hiyouga
|
f518bfba5b
|
[deps] upgrade transformers (#7704)
|
2025-04-13 18:11:34 +08:00 |
|
jilongW
|
1b0934bccb
|
[misc] fix cuda warn on intel GPU (#7655)
|
2025-04-09 21:37:54 +08:00 |
|
hoshi-hiyouga
|
4eec541857
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
hoshi-hiyouga
|
1abd71b551
|
[assets] update readme (#7644)
|
2025-04-09 01:06:06 +08:00 |
|
hoshi-hiyouga
|
831e7f1cfd
|
[model] add llama4 (#7611)
|
2025-04-06 13:42:31 +08:00 |
|
hoshi-hiyouga
|
2bfcad2394
|
[model] fix kv cache (#7564)
|
2025-04-01 23:07:46 +08:00 |
|
hoshi-hiyouga
|
0583d06676
|
[model] add qwen2vl 32b & upgrade peft (#7469)
* add qwen2vl 32b
* fix ci
* upgrade peft to 0.15
* fix ci
* fix ci
|
2025-03-25 12:15:58 +08:00 |
|
GuoCoder
|
ec6a261568
|
[model] fix lora on quant models (#7456)
Co-authored-by: root <root@ai>
|
2025-03-25 11:59:46 +08:00 |
|
hoshi-hiyouga
|
05b19d6952
|
[deps] upgrade transformers to 4.50.0 (#7437)
* upgrade transformers
* fix hf cache
* fix dpo trainer
|
2025-03-23 17:44:27 +08:00 |
|
Qiaolin Yu
|
a44a53ebec
|
[inference] support sglang backend (#7278)
* Mimic SGLang offline Engine
* Add more tests and args
* Pass all current tests
* Clean Code
* fix sample_params
* clean code
* Fix Stream Chat
* change sglang from engine mode to server mode
* fix
* Fix Review Issues
* Use SGLang Built-In Utilities
* Fix test SGLang
* Some Doc Issue
* fix sglang engine
* add readme
---------
Co-authored-by: Jin Pan <jpan236@wisc.edu>
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
|
2025-03-15 04:37:58 +08:00 |
|
hoshi-hiyouga
|
650a9a9057
|
[misc] update format (#7277)
|
2025-03-13 02:53:08 +08:00 |
|
hoshi-hiyouga
|
e6159ad730
|
[misc] upgrade deps (#7257)
|
2025-03-12 00:33:47 +08:00 |
|
hoshi-hiyouga
|
264538cb26
|
[misc] upgrade format to py39 (#7256)
|
2025-03-12 00:08:41 +08:00 |
|
hoshi-hiyouga
|
1d675a287d
|
[version] support transformers 449 (#6982)
* support transformers 449
* fix mm plugin
Former-commit-id: e9118a9df0839d24f6ddff5a0b55ef101a1d3d22
|
2025-02-18 17:05:40 +08:00 |
|
hoshi-hiyouga
|
4d1791e905
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
a28261a866
|
[model] add mistral small models (#6786)
Former-commit-id: e5e95c39bc4199fa89c67e34f9adaaa987058744
|
2025-02-01 04:31:38 +08:00 |
|
hoshi-hiyouga
|
222423bcef
|
[breaking] support transformers 4.48 (#6628)
Former-commit-id: f154ab175c513a4d7bb866bf2cffc34b77b50508
|
2025-01-31 01:36:33 +08:00 |
|
hiyouga
|
647c51a772
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
944a2aec4d
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
hiyouga
|
f8f05a883b
|
fix #6482
Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
c1768cfb14
|
support batch infer in vllm
Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5
|
2024-12-04 13:50:00 +00:00 |
|
Ting
|
87b1f851f1
|
code refactor
Former-commit-id: ee3f85aa9677d0aeecb3bc396530d2cd7c50dce5
|
2024-11-19 20:33:18 +08:00 |
|
hiyouga
|
b104739d63
|
update datasets version
Former-commit-id: feba2c6418a15715fee77a34428fa3cf47fcee5b
|
2024-11-04 07:52:26 +00:00 |
|
hiyouga
|
093eda2ad6
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
8185eb1890
|
fix incorrect loss value for vlms
Former-commit-id: 0aa29a71ce958343a2086090d647eb63b8f5f5be
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
8f5921692e
|
update requires
Former-commit-id: cae0e688ddcead370821e126c192bddc53ff6017
|
2024-10-29 16:10:07 +08:00 |
|
hiyouga
|
c7efc7f2ed
|
tiny fix
Former-commit-id: 1fe424323b212094856f423351dc2a15774d39c3
|
2024-10-11 23:51:54 +08:00 |
|
huniu20
|
a6951db970
|
bugs fixed
Former-commit-id: 5457ba7512d70564ea784b9ec6bdb86cfd2d7e3d
|
2024-10-11 19:56:13 +08:00 |
|
huniu20
|
c42dcab32b
|
1. add modelers hub support
Former-commit-id: 14678eb444d8181176745d18d4a6865fd6860f58
|
2024-10-09 17:21:37 +08:00 |
|
hiyouga
|
b2dc6dc59a
|
tiny fix
Former-commit-id: d8ddd07c2ed14d871fb25743c20265fc99e3e221
|
2024-10-08 17:48:56 +08:00 |
|
hiyouga
|
588ea95732
|
update accelerate ver for schedule_free optimizers
Former-commit-id: 2de74e79049ce8e50f605f649275b1dbfb899c8c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
7e4c5d4bb3
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
7c6785d3df
|
fix #5295
Former-commit-id: c76873b0eb8225f6e6bfc7223c6012387dceb8ed
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
ca5a759f94
|
tiny fix
Former-commit-id: d2cede7023bbe28525ef8b4ad27247445d8c22e5
|
2024-08-27 12:49:32 +08:00 |
|
hiyouga
|
d111a324bc
|
tiny fix
Former-commit-id: 23961bdf6fdbcde64e7b943f699fdeb4ac024043
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
525747b472
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: cd3c536cb3936061d905256850b0e57df4498010
|
2024-08-19 23:51:39 +08:00 |
|
Ricardo
|
57d4c4a4f8
|
_is_bf16_available judgment supports npu
Former-commit-id: 50a1e892a1005b4cdd82dca1005f71db08ed89a2
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
3595d26846
|
fix: report correct device count for intel xpu
Former-commit-id: 0618f660b6511599365bd9be64499dbab41a79ba
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
hiyouga
|
8c93921952
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
188b4be64d
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
7f90b0cd20
|
add tests
Former-commit-id: 484634ee9c982e82e919ff67d507e0210345182d
|
2024-06-15 19:51:20 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
d0edcde4ea
|
fix #4120
Former-commit-id: 2a44da678a5e360a9c0f9056397ac9e801329321
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
80f716bc10
|
fix torch gc
Former-commit-id: e173799d057598e5692a407601c30d8ce1513461
|
2024-06-06 20:30:25 +08:00 |
|