hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
6e0fba60b3
|
Support Atom Model
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
88a1bc9773
|
lint
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
85622ae757
|
add models
|
2024-02-06 14:57:23 +08:00 |
|
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
|
hiyouga
|
6fc2d5cc03
|
add orion models
|
2024-01-22 21:26:53 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
|
hiyouga
|
bf73224f33
|
support solar 10.7B #1907
|
2024-01-14 00:30:30 +08:00 |
|
hiyouga
|
ca3933dc52
|
support deepseek moe
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
|
hiyouga
|
919acc2b0b
|
modify weight name
|
2024-01-09 20:22:47 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
c7ea17d616
|
add yuan model
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
2df923540c
|
add xverse-65B-2 model
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
709ac8870a
|
add models
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
7ae6919b9b
|
add xverse-65b-chat model
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
71389be37c
|
support autogptq in llama board #246
|
2023-12-16 16:31:30 +08:00 |
|
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
3552035d7e
|
add model urls
|
2023-12-13 00:09:17 +08:00 |
|
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
e25f7bae16
|
add models
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
6e7af11b98
|
add xuanyuan models
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
bd42c229b0
|
patch modelscope
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
5a2392f105
|
remove useless code
|
2023-12-01 17:28:23 +08:00 |
|
tastelikefeet
|
d9e52957e2
|
fix bug
|
2023-12-01 17:27:00 +08:00 |
|
yuze.zyz
|
5aa6751e52
|
add readme
|
2023-12-01 16:11:30 +08:00 |
|
tastelikefeet
|
8ce4d11e38
|
add model
|
2023-12-01 15:06:17 +08:00 |
|
yuze.zyz
|
fb2204c183
|
fix
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
d38a2e7341
|
support ms
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
475a3fa0f4
|
fix #1659
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
ff1c289229
|
support Yi-34B-Chat models
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
7537dd434f
|
update ppo and demo in webui
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
1e19cf242a
|
update readme and constants
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
35cc1e28f6
|
release v0.2.2, fix #1478 #1466
|
2023-11-13 23:09:05 +08:00 |
|
hiyouga
|
3697a3dc9a
|
refactor constants
|
2023-11-10 14:16:10 +08:00 |
|
hiyouga
|
b2a60905f3
|
upgrade peft, fix #1088 #1411
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
f28a034a9b
|
update constants
|
2023-10-29 13:30:20 +08:00 |
|
hiyouga
|
d11a545463
|
fix #1068 #1074
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
84b7486885
|
fix layer norm dtype
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
465ee8119a
|
add MMLU and C-Eval script
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
b218c271ed
|
remove PeftTrainer
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
0531886e1f
|
update baichuan2 template
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
62ce65c628
|
add Baichuan2 models
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
c0e4d1e81b
|
add dataset stage and filter dataset when stage chosen in webui
|
2023-08-23 18:54:23 +08:00 |
|