hiyouga
5c00783697
update hardware requirements
...
Former-commit-id: 393c2de27c
2024-03-09 03:58:18 +08:00
hiyouga
eb363b04b9
update examples
...
Former-commit-id: 4c00bcdcae
2024-03-09 02:30:37 +08:00
hiyouga
c561b268ef
fix #2756 , patch #2746
...
Former-commit-id: e8dd38b7fd
2024-03-09 02:01:26 +08:00
hoshi-hiyouga
36d65289d0
Merge pull request #2746 from stephen-nju/main
...
fix deepspeed ppo RuntimeError
Former-commit-id: 516d0ddc66
2024-03-09 01:37:00 +08:00
hiyouga
247aab9066
Update setup.py
...
Former-commit-id: 74ff8664d7
2024-03-09 00:14:48 +08:00
hiyouga
398c261c7c
fix aqlm version
...
Former-commit-id: 10be2f0ecc
2024-03-09 00:09:09 +08:00
hiyouga
ccec17f773
fix example params
...
Former-commit-id: 8a45213440
2024-03-08 20:41:43 +08:00
stephen_zhu
c69b9fbe58
update
...
Former-commit-id: aa71571b77
2024-03-08 12:47:44 +08:00
stephen
495b858606
fix ppo runtime error
...
Former-commit-id: cdb7f82869
2024-03-08 11:48:26 +08:00
S3Studio
de41334055
Add dockerize support
...
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.
Former-commit-id: 3d911ae713
2024-03-08 10:47:28 +08:00
hiyouga
b268215a0e
update readme
...
Former-commit-id: 4a2cc60b94
2024-03-08 03:06:21 +08:00
hiyouga
7443ac3116
fix chat engine, update webui
...
Former-commit-id: 5d956e2a51
2024-03-08 03:01:53 +08:00
hiyouga
0a0959facf
Update setup.py
...
Former-commit-id: 5cd4947650
2024-03-08 01:23:00 +08:00
hiyouga
2235020cc9
update galore args
...
Former-commit-id: 0ac6b40a47
2024-03-08 01:17:32 +08:00
hiyouga
5b50458acf
fix galore
...
Former-commit-id: 33a4c24a8a
2024-03-08 00:44:51 +08:00
hiyouga
f373290012
add Yi-9B model
...
Former-commit-id: 57452a4aa1
2024-03-07 23:11:57 +08:00
hiyouga
cb2bf680c9
add galore examples
...
Former-commit-id: 7230e1177d
2024-03-07 22:53:45 +08:00
hiyouga
2c010c72b8
support galore
...
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
1af71f548c
update readme
...
Former-commit-id: 725f7cd70f
2024-03-07 20:34:49 +08:00
hiyouga
583d956bda
tiny fix
...
Former-commit-id: 77211d9843
2024-03-07 20:29:34 +08:00
hoshi-hiyouga
86ba1a5c5b
Merge pull request #2739 from hiyouga/dev-vllm
...
support vllm
Former-commit-id: a0dc721816
2024-03-07 20:28:18 +08:00
hiyouga
34533b2f35
support vllm
...
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
37e40563f1
fix #2735
...
Former-commit-id: f74f804a71
2024-03-07 16:15:53 +08:00
hoshi-hiyouga
90e66c8d94
Merge pull request #2730 from cx2333-gt/main
...
fix flash_attn in train_web
Former-commit-id: 2185855bdb
2024-03-07 14:37:18 +08:00
cx2333
013c12a135
revert choice name
...
Former-commit-id: 94b7a1b915
2024-03-07 14:28:55 +08:00
hiyouga
843d3f7a97
fix chatglm3 template
...
Former-commit-id: 921ee82267
2024-03-07 14:26:16 +08:00
hiyouga
a5dcb4fcf4
Update wechat.jpg
...
Former-commit-id: 08d7dc06f2
2024-03-07 13:14:10 +08:00
cx2333
22624e566e
fix flash_attn in train_web
...
Former-commit-id: a8889498fa
2024-03-07 10:13:55 +08:00
hiyouga
31c618f1f7
tiny fix
...
Former-commit-id: 0048a2021e
2024-03-06 17:25:08 +08:00
hiyouga
8b6c178249
export use balanced gpu
...
Former-commit-id: 3e84f430b1
2024-03-06 16:33:14 +08:00
hiyouga
8b21a60d9c
fix add tokens
...
Former-commit-id: 9658c63cd9
2024-03-06 15:04:02 +08:00
hiyouga
e887aface7
fix version checking
...
Former-commit-id: 3016e65657
2024-03-06 14:51:51 +08:00
hiyouga
8d386775f2
update examples
...
Former-commit-id: d1587c80de
2024-03-06 13:14:57 +08:00
hiyouga
af526c3a46
fix arg dtype
...
Former-commit-id: e0c47358f9
2024-03-05 20:53:30 +08:00
hiyouga
9561809ce9
improve aqlm optim
...
Former-commit-id: 259af60d28
2024-03-05 20:49:50 +08:00
hiyouga
c776cdfc3e
optimize aqlm training
...
Former-commit-id: d3d3dac707
2024-03-05 18:35:41 +08:00
hiyouga
0f2250b831
fix dora inference
...
Former-commit-id: ddf352f861
2024-03-05 11:51:41 +08:00
hiyouga
768358b960
fix export model
...
Former-commit-id: e5edcf440f
2024-03-05 11:05:41 +08:00
hiyouga
02eac3fd09
update readme
...
Former-commit-id: df9e6bb063
2024-03-05 03:20:23 +08:00
hiyouga
8cf9842f7a
add examples
...
Former-commit-id: 76f31b18eb
2024-03-05 03:16:35 +08:00
hiyouga
eeb34e5696
auto set chat template
...
Former-commit-id: 9e56eaf2d3
2024-03-05 02:41:20 +08:00
hiyouga
b04316d9a8
update readme
...
Former-commit-id: 24a79bd50f
2024-03-04 19:29:26 +08:00
hiyouga
a62d17d009
fix export on cpu device
...
Former-commit-id: cda2ff8727
2024-03-04 17:35:09 +08:00
hiyouga
0e58cd6422
fix sub-process error in thread
...
Former-commit-id: 9c10854b46
2024-03-03 15:04:35 +08:00
hiyouga
d966aee105
update readme
...
Former-commit-id: 7c227e07dd
2024-03-03 01:41:07 +08:00
hiyouga
9ae1514a75
update readme, add starcoder2, cosmopedia
...
Former-commit-id: 894d183214
2024-03-03 01:01:46 +08:00
hoshi-hiyouga
b55880bf28
Update README_zh.md
...
Former-commit-id: 1006f372ae
2024-03-03 00:49:08 +08:00
hoshi-hiyouga
dcd0d92978
Update README.md
...
Former-commit-id: 4bf7eb72e0
2024-03-03 00:48:47 +08:00
hoshi-hiyouga
5f9b1ad80c
Update README.md
...
Former-commit-id: 585c884ea9
2024-03-03 00:48:06 +08:00
hiyouga
caa6aa9dc5
add colab demo
...
Former-commit-id: 318315c76d
2024-03-02 19:58:21 +08:00