Commit Graph

501 Commits

Author SHA1 Message Date
hiyouga
c52336d144 fix reward model loading 2023-11-07 17:20:51 +08:00
hiyouga
d92f112951 fix args 2023-11-07 16:36:06 +08:00
hiyouga
17c64a0579 update info 2023-11-07 16:28:21 +08:00
hiyouga
479d0af2dc delete file 2023-11-07 16:20:12 +08:00
hiyouga
7ebd63a609 fix #1418 2023-11-07 16:17:22 +08:00
hiyouga
b2a60905f3 upgrade peft, fix #1088 #1411 2023-11-07 16:13:36 +08:00
hiyouga
66a91e1fe3 update requirements 2023-11-06 19:01:21 +08:00
hiyouga
de95b69282 use seed in evaluate.py 2023-11-06 18:17:51 +08:00
hiyouga
e1e04cb1f1 update readme (list in alphabetical order) 2023-11-06 17:18:12 +08:00
hiyouga
a7eeb8e17c update templates 2023-11-06 12:25:47 +08:00
hiyouga
2e77a5718a fix #1383 2023-11-06 11:42:23 +08:00
hiyouga
d08f5e8a14 fix deepseek template 2023-11-05 13:08:46 +08:00
hiyouga
2a8a258195 support deepseek coder #1378 2023-11-05 12:51:03 +08:00
hiyouga
63ff909310 fix #1365 2023-11-05 12:21:07 +08:00
hiyouga
5227e18c44 Update wechat.jpg 2023-11-05 10:25:59 +08:00
hiyouga
05d9fc7eff tiny fix 2023-11-03 01:26:06 +08:00
hiyouga
eb9d9e104a fix #1290 2023-11-03 00:44:53 +08:00
hiyouga
b355f6cac9 fix bug in data loader, support dpo eval 2023-11-03 00:34:26 +08:00
hiyouga
2b5e33c338 update data readme 2023-11-03 00:15:23 +08:00
hiyouga
cc8ffa10d8 update data readme (zh) 2023-11-02 23:42:49 +08:00
hiyouga
a837172413 support sharegpt format, add datasets 2023-11-02 23:10:04 +08:00
hiyouga
c1edb0cf1b support pagination in webui preview 2023-11-02 21:21:45 +08:00
hiyouga
34d8b2e56c fix webui 2023-11-02 18:03:14 +08:00
hiyouga
9cde5e8af6 support warning in webui 2023-11-02 17:57:04 +08:00
hiyouga
f8703aac08 fix #1349 2023-11-02 17:02:44 +08:00
hiyouga
dff128c7e3 fix #1356 2023-11-02 16:51:52 +08:00
hiyouga
083787dbfe fix #1325 2023-11-01 23:38:49 +08:00
hiyouga
8b912690e3 fix chat 2023-11-01 23:07:58 +08:00
hiyouga
84af10cec9 update gradio, support multiple resp in api 2023-11-01 23:02:16 +08:00
hiyouga
d8cf8cfdeb fix SFT trainer 2023-10-31 21:52:52 +08:00
hiyouga
f4e4a04529 fix #1316 2023-10-31 11:32:08 +08:00
hiyouga
9093cb1a2e Update wechat.jpg 2023-10-30 14:01:08 +08:00
hiyouga
640a520108 update projects 2023-10-29 22:53:47 +08:00
hiyouga
59f342e76f add projects 2023-10-29 22:07:13 +08:00
hiyouga
f28a034a9b update constants 2023-10-29 13:30:20 +08:00
hiyouga
52fc24d166 fix vicuna template 2023-10-27 22:15:25 +08:00
hiyouga
4117f38827 fix chatglm3 template 2023-10-27 21:12:06 +08:00
hiyouga
4600c29e93 update readme 2023-10-27 19:19:03 +08:00
hiyouga
1c0ab9a908 support chatglm3 2023-10-27 19:16:28 +08:00
hiyouga
3fe7df628d support dataset cache 2023-10-26 21:48:45 +08:00
hiyouga
838ed9aa87 fix #1287 2023-10-26 17:49:41 +08:00
hiyouga
aff9363ce3 fix #1285 2023-10-26 16:34:52 +08:00
hiyouga
d357e08b58 Update wechat.jpg 2023-10-24 16:02:12 +08:00
hiyouga
2caf91f824 remove filter in preprocess 2023-10-23 23:46:02 +08:00
hiyouga
7de7174ce3 update neftune logic 2023-10-22 17:42:13 +08:00
hiyouga
11b55a3270 fix webui 2023-10-22 17:24:56 +08:00
hiyouga
f793ca0a2c add new options in webui 2023-10-22 17:17:58 +08:00
hiyouga
b79ca8781e fix recursion error 2023-10-22 16:28:37 +08:00
hiyouga
7b4acf7265 reimplement neftune 2023-10-22 16:15:08 +08:00
hoshi-hiyouga
b42a145253 Merge pull request #1252 from anvie/neftune
add NEFTune optimization
2023-10-22 15:59:20 +08:00