hoshi-hiyouga
|
2185855bdb
|
Merge pull request #2730 from cx2333-gt/main
fix flash_attn in train_web
|
2024-03-07 14:37:18 +08:00 |
|
cx2333
|
94b7a1b915
|
revert choice name
|
2024-03-07 14:28:55 +08:00 |
|
hiyouga
|
921ee82267
|
fix chatglm3 template
|
2024-03-07 14:26:16 +08:00 |
|
cx2333
|
a8889498fa
|
fix flash_attn in train_web
|
2024-03-07 10:13:55 +08:00 |
|
hiyouga
|
0048a2021e
|
tiny fix
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
3e84f430b1
|
export use balanced gpu
|
2024-03-06 16:33:14 +08:00 |
|
hiyouga
|
9658c63cd9
|
fix add tokens
|
2024-03-06 15:04:02 +08:00 |
|
hiyouga
|
3016e65657
|
fix version checking
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
e0c47358f9
|
fix arg dtype
|
2024-03-05 20:53:30 +08:00 |
|
hiyouga
|
259af60d28
|
improve aqlm optim
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
d3d3dac707
|
optimize aqlm training
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
ddf352f861
|
fix dora inference
|
2024-03-05 11:51:41 +08:00 |
|
hiyouga
|
e5edcf440f
|
fix export model
|
2024-03-05 11:05:41 +08:00 |
|
hiyouga
|
9e56eaf2d3
|
auto set chat template
|
2024-03-05 02:41:20 +08:00 |
|
hiyouga
|
24a79bd50f
|
update readme
|
2024-03-04 19:29:26 +08:00 |
|
hiyouga
|
cda2ff8727
|
fix export on cpu device
|
2024-03-04 17:35:09 +08:00 |
|
hiyouga
|
9c10854b46
|
fix sub-process error in thread
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
894d183214
|
update readme, add starcoder2, cosmopedia
|
2024-03-03 01:01:46 +08:00 |
|
hiyouga
|
4e5fae2fac
|
fix #2649
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
396fd47947
|
tiny fix
|
2024-02-29 21:03:48 +08:00 |
|
hiyouga
|
1bfa70ce8e
|
fix webui
|
2024-02-29 20:09:09 +08:00 |
|
hiyouga
|
c0be617195
|
fix #2642
|
2024-02-29 18:32:54 +08:00 |
|
hiyouga
|
4a871e80e2
|
tiny fix
|
2024-02-29 17:28:50 +08:00 |
|
hiyouga
|
ece3b3737e
|
tiny fix and release
|
2024-02-29 00:46:47 +08:00 |
|
hoshi-hiyouga
|
7c87532476
|
Merge pull request #2575 from lungothrin/feature/chatter-with-role
support on fly test of tools
|
2024-02-29 00:39:47 +08:00 |
|
hiyouga
|
4cc2781efe
|
fix #2629
|
2024-02-29 00:37:29 +08:00 |
|
hiyouga
|
fa5ab21ebc
|
release v0.5.3
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
cfefacaa37
|
support DoRA, AWQ, AQLM #2512
|
2024-02-28 19:53:28 +08:00 |
|
Liang Ge
|
56b3918fda
|
support on fly test of tools
|
2024-02-28 01:17:49 +08:00 |
|
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
6e0fba60b3
|
Support Atom Model
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
2e592be536
|
update webui
|
2024-02-25 20:23:41 +08:00 |
|
hoshi-hiyouga
|
4aab19c7ef
|
Merge pull request #2525 from stephen-nju/main
update project_kwargs for ppo config
|
2024-02-25 15:54:00 +08:00 |
|
hiyouga
|
354f13c01a
|
fix data entry
|
2024-02-23 18:29:24 +08:00 |
|
hiyouga
|
6bf4c1274f
|
fix gemma template
|
2024-02-23 13:49:53 +08:00 |
|
hiyouga
|
a87838ded1
|
fix template
|
2024-02-22 12:09:21 +08:00 |
|
hiyouga
|
c375a20230
|
fix template
|
2024-02-22 12:06:48 +08:00 |
|
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
3cc10a01a7
|
fix #2532
|
2024-02-21 21:55:14 +08:00 |
|
hiyouga
|
daa3185350
|
tiny fix
|
2024-02-21 18:30:29 +08:00 |
|
stephen
|
42c23798f2
|
update project_kwargs for ppo config
|
2024-02-21 13:47:38 +08:00 |
|
hiyouga
|
9aeb404a94
|
support lora for llama pro
|
2024-02-21 02:17:22 +08:00 |
|
hiyouga
|
02c8c55ce3
|
fix #2516
|
2024-02-20 20:44:24 +08:00 |
|
hiyouga
|
5bbec1c1f2
|
release v0.5.2
|
2024-02-20 11:12:43 +08:00 |
|
hiyouga
|
ba998c67ab
|
update webui
|
2024-02-19 16:49:58 +08:00 |
|
hiyouga
|
22acab8aff
|
fix #2481
|
2024-02-15 19:07:47 +08:00 |
|
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
|
younesbelkada
|
0ca0f08162
|
add v1 hf tags
|
2024-02-13 05:58:49 +00:00 |
|
hiyouga
|
12b2066e34
|
fix #2471
|
2024-02-12 21:07:46 +08:00 |
|