Commit Graph

46 Commits

Author SHA1 Message Date
hiyouga
c7ea17d616 add yuan model 2023-12-29 13:50:24 +08:00
hiyouga
2df923540c add xverse-65B-2 model 2023-12-18 19:24:09 +08:00
hiyouga
709ac8870a add models 2023-12-18 19:09:31 +08:00
hiyouga
7ae6919b9b add xverse-65b-chat model 2023-12-16 20:21:29 +08:00
hiyouga
71389be37c support autogptq in llama board #246 2023-12-16 16:31:30 +08:00
hiyouga
3524aa1e58 support quantization in export model 2023-12-15 23:44:50 +08:00
hiyouga
3552035d7e add model urls 2023-12-13 00:09:17 +08:00
hiyouga
96380f5e18 support mixtral 2023-12-12 11:39:04 +08:00
hiyouga
e25f7bae16 add models 2023-12-06 13:33:18 +08:00
hiyouga
6e7af11b98 add xuanyuan models 2023-12-02 00:35:29 +08:00
hiyouga
bd42c229b0 patch modelscope 2023-12-01 22:53:15 +08:00
yuze.zyz
5a2392f105 remove useless code 2023-12-01 17:28:23 +08:00
tastelikefeet
d9e52957e2 fix bug 2023-12-01 17:27:00 +08:00
yuze.zyz
5aa6751e52 add readme 2023-12-01 16:11:30 +08:00
tastelikefeet
8ce4d11e38 add model 2023-12-01 15:06:17 +08:00
yuze.zyz
fb2204c183 fix 2023-11-29 21:43:58 +08:00
yuze.zyz
d38a2e7341 support ms 2023-11-29 20:36:55 +08:00
hiyouga
475a3fa0f4 fix #1659 2023-11-28 20:52:28 +08:00
hiyouga
ff1c289229 support Yi-34B-Chat models 2023-11-23 19:31:49 +08:00
hiyouga
7537dd434f update ppo and demo in webui 2023-11-16 14:55:26 +08:00
hiyouga
1e19cf242a update readme and constants 2023-11-15 18:04:37 +08:00
hiyouga
4736344eb1 disentangle model from tuner and rename modules 2023-11-15 16:29:09 +08:00
hiyouga
35cc1e28f6 release v0.2.2, fix #1478 #1466 2023-11-13 23:09:05 +08:00
hiyouga
3697a3dc9a refactor constants 2023-11-10 14:16:10 +08:00
hiyouga
b2a60905f3 upgrade peft, fix #1088 #1411 2023-11-07 16:13:36 +08:00
hiyouga
f28a034a9b update constants 2023-10-29 13:30:20 +08:00
hiyouga
d11a545463 fix #1068 #1074 2023-09-28 14:39:16 +08:00
hiyouga
84b7486885 fix layer norm dtype 2023-09-28 00:25:55 +08:00
hiyouga
465ee8119a add MMLU and C-Eval script 2023-09-23 00:34:17 +08:00
hiyouga
b218c271ed remove PeftTrainer 2023-09-10 22:23:23 +08:00
hiyouga
0531886e1f update baichuan2 template 2023-09-06 21:43:06 +08:00
hiyouga
62ce65c628 add Baichuan2 models 2023-09-06 18:36:04 +08:00
hiyouga
a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
codemayq
c0e4d1e81b add dataset stage and filter dataset when stage chosen in webui 2023-08-23 18:54:23 +08:00
hiyouga
9d0f6214b6 update webui 2023-08-14 22:45:26 +08:00
codemayq
79c68e5527 add template match and stage in webui 2023-08-14 20:42:59 +08:00
hiyouga
fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
codemayq
6bc8e9866d add sft script preview in webui 2023-08-12 13:53:55 +08:00
hiyouga
a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga
69744c17e8 support interleave probs 2023-08-04 21:27:35 +08:00
hiyouga
4d1641c1bf update UI, fix #212 2023-07-20 22:09:06 +08:00
hiyouga
7a3ade8c69 support LLaMA-2 2023-07-19 16:42:14 +08:00
hiyouga
262252d67b a monkey patch for lora_target 2023-07-18 00:31:40 +08:00
hiyouga
f8193e8009 release v0.1.0 2023-07-18 00:18:25 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00