hiyouga
|
c7ea17d616
|
add yuan model
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
2df923540c
|
add xverse-65B-2 model
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
709ac8870a
|
add models
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
7ae6919b9b
|
add xverse-65b-chat model
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
71389be37c
|
support autogptq in llama board #246
|
2023-12-16 16:31:30 +08:00 |
|
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
3552035d7e
|
add model urls
|
2023-12-13 00:09:17 +08:00 |
|
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
e25f7bae16
|
add models
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
6e7af11b98
|
add xuanyuan models
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
bd42c229b0
|
patch modelscope
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
5a2392f105
|
remove useless code
|
2023-12-01 17:28:23 +08:00 |
|
tastelikefeet
|
d9e52957e2
|
fix bug
|
2023-12-01 17:27:00 +08:00 |
|
yuze.zyz
|
5aa6751e52
|
add readme
|
2023-12-01 16:11:30 +08:00 |
|
tastelikefeet
|
8ce4d11e38
|
add model
|
2023-12-01 15:06:17 +08:00 |
|
yuze.zyz
|
fb2204c183
|
fix
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
d38a2e7341
|
support ms
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
475a3fa0f4
|
fix #1659
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
ff1c289229
|
support Yi-34B-Chat models
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
7537dd434f
|
update ppo and demo in webui
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
1e19cf242a
|
update readme and constants
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
35cc1e28f6
|
release v0.2.2, fix #1478 #1466
|
2023-11-13 23:09:05 +08:00 |
|
hiyouga
|
3697a3dc9a
|
refactor constants
|
2023-11-10 14:16:10 +08:00 |
|
hiyouga
|
b2a60905f3
|
upgrade peft, fix #1088 #1411
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
f28a034a9b
|
update constants
|
2023-10-29 13:30:20 +08:00 |
|
hiyouga
|
d11a545463
|
fix #1068 #1074
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
84b7486885
|
fix layer norm dtype
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
465ee8119a
|
add MMLU and C-Eval script
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
b218c271ed
|
remove PeftTrainer
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
0531886e1f
|
update baichuan2 template
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
62ce65c628
|
add Baichuan2 models
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
c0e4d1e81b
|
add dataset stage and filter dataset when stage chosen in webui
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
9d0f6214b6
|
update webui
|
2023-08-14 22:45:26 +08:00 |
|
codemayq
|
79c68e5527
|
add template match and stage in webui
|
2023-08-14 20:42:59 +08:00 |
|
hiyouga
|
fa940c17b8
|
support rope scaling, fix #475 #476 #478
|
2023-08-12 20:46:27 +08:00 |
|
codemayq
|
6bc8e9866d
|
add sft script preview in webui
|
2023-08-12 13:53:55 +08:00 |
|
hiyouga
|
a48cb0d474
|
Release v0.1.6
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
3ec4351cfd
|
support DPO training (2305.18290)
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
69744c17e8
|
support interleave probs
|
2023-08-04 21:27:35 +08:00 |
|
hiyouga
|
4d1641c1bf
|
update UI, fix #212
|
2023-07-20 22:09:06 +08:00 |
|
hiyouga
|
7a3ade8c69
|
support LLaMA-2
|
2023-07-19 16:42:14 +08:00 |
|
hiyouga
|
262252d67b
|
a monkey patch for lora_target
|
2023-07-18 00:31:40 +08:00 |
|
hiyouga
|
f8193e8009
|
release v0.1.0
|
2023-07-18 00:18:25 +08:00 |
|
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
|