Commit Graph

978 Commits

Author SHA1 Message Date
hiyouga
5cd4947650 Update setup.py 2024-03-08 01:23:00 +08:00
hiyouga
0ac6b40a47 update galore args 2024-03-08 01:17:32 +08:00
hiyouga
33a4c24a8a fix galore 2024-03-08 00:44:51 +08:00
hiyouga
57452a4aa1 add Yi-9B model 2024-03-07 23:11:57 +08:00
hiyouga
7230e1177d add galore examples 2024-03-07 22:53:45 +08:00
hiyouga
28f7862188 support galore 2024-03-07 22:41:36 +08:00
hiyouga
725f7cd70f update readme 2024-03-07 20:34:49 +08:00
hiyouga
77211d9843 tiny fix 2024-03-07 20:29:34 +08:00
hoshi-hiyouga
a0dc721816 Merge pull request #2739 from hiyouga/dev-vllm
support vllm
2024-03-07 20:28:18 +08:00
hiyouga
d07ad5cc1c support vllm 2024-03-07 20:26:31 +08:00
hiyouga
f74f804a71 fix #2735 2024-03-07 16:15:53 +08:00
hoshi-hiyouga
2185855bdb Merge pull request #2730 from cx2333-gt/main
fix flash_attn in train_web
2024-03-07 14:37:18 +08:00
cx2333
94b7a1b915 revert choice name 2024-03-07 14:28:55 +08:00
hiyouga
921ee82267 fix chatglm3 template 2024-03-07 14:26:16 +08:00
hiyouga
08d7dc06f2 Update wechat.jpg 2024-03-07 13:14:10 +08:00
cx2333
a8889498fa fix flash_attn in train_web 2024-03-07 10:13:55 +08:00
hiyouga
0048a2021e tiny fix 2024-03-06 17:25:08 +08:00
hiyouga
3e84f430b1 export use balanced gpu 2024-03-06 16:33:14 +08:00
hiyouga
9658c63cd9 fix add tokens 2024-03-06 15:04:02 +08:00
hiyouga
3016e65657 fix version checking 2024-03-06 14:51:51 +08:00
hiyouga
d1587c80de update examples 2024-03-06 13:14:57 +08:00
hiyouga
e0c47358f9 fix arg dtype 2024-03-05 20:53:30 +08:00
hiyouga
259af60d28 improve aqlm optim 2024-03-05 20:49:50 +08:00
hiyouga
d3d3dac707 optimize aqlm training 2024-03-05 18:35:41 +08:00
hiyouga
ddf352f861 fix dora inference 2024-03-05 11:51:41 +08:00
hiyouga
e5edcf440f fix export model 2024-03-05 11:05:41 +08:00
hiyouga
df9e6bb063 update readme 2024-03-05 03:20:23 +08:00
hiyouga
76f31b18eb add examples 2024-03-05 03:16:35 +08:00
hiyouga
9e56eaf2d3 auto set chat template 2024-03-05 02:41:20 +08:00
hiyouga
24a79bd50f update readme 2024-03-04 19:29:26 +08:00
hiyouga
cda2ff8727 fix export on cpu device 2024-03-04 17:35:09 +08:00
hiyouga
9c10854b46 fix sub-process error in thread 2024-03-03 15:04:35 +08:00
hiyouga
7c227e07dd update readme 2024-03-03 01:41:07 +08:00
hiyouga
894d183214 update readme, add starcoder2, cosmopedia 2024-03-03 01:01:46 +08:00
hoshi-hiyouga
1006f372ae Update README_zh.md 2024-03-03 00:49:08 +08:00
hoshi-hiyouga
4bf7eb72e0 Update README.md 2024-03-03 00:48:47 +08:00
hoshi-hiyouga
585c884ea9 Update README.md 2024-03-03 00:48:06 +08:00
hiyouga
318315c76d add colab demo 2024-03-02 19:58:21 +08:00
hiyouga
32884523c5 update data 2024-03-02 19:37:18 +08:00
hiyouga
a736b349f0 move git files 2024-03-02 18:30:11 +08:00
hiyouga
46a06e2362 Update wechat.jpg 2024-03-02 17:48:16 +08:00
hiyouga
4e5fae2fac fix #2649 2024-03-01 13:02:41 +08:00
hiyouga
396fd47947 tiny fix 2024-02-29 21:03:48 +08:00
hiyouga
1bfa70ce8e fix webui 2024-02-29 20:09:09 +08:00
hiyouga
c0be617195 fix #2642 2024-02-29 18:32:54 +08:00
hiyouga
bb16502c33 add twitter 2024-02-29 17:45:30 +08:00
hiyouga
4a871e80e2 tiny fix 2024-02-29 17:28:50 +08:00
hiyouga
ece3b3737e tiny fix and release 2024-02-29 00:46:47 +08:00
hoshi-hiyouga
7c87532476 Merge pull request #2575 from lungothrin/feature/chatter-with-role
support on fly test of tools
2024-02-29 00:39:47 +08:00
hiyouga
4cc2781efe fix #2629 2024-02-29 00:37:29 +08:00