Commit Graph

271 Commits

Author SHA1 Message Date
hiyouga
ca08e5efd3 fix webui cache 2023-08-14 11:37:01 +08:00
hiyouga
2391a84e26 update readme_zh 2023-08-14 11:13:25 +08:00
hiyouga
ec94274ca1 web UI integrating RLHF 2023-08-14 10:48:47 +08:00
hiyouga
2f2fd55d81 fix #480 2023-08-14 00:23:56 +08:00
hiyouga
d69b1388e6 fix webui 2023-08-12 23:52:07 +08:00
hiyouga
9dc6a296e3 tiny fix 2023-08-12 22:02:43 +08:00
hiyouga
8545c11c45 fix rope scaling 2023-08-12 22:00:01 +08:00
hiyouga
8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga
3ea1fa35d1 update readme 2023-08-12 21:25:19 +08:00
hiyouga
2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00
hiyouga
1836c020c5 update readme 2023-08-12 21:00:11 +08:00
hiyouga
fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hoshi-hiyouga
2eb0eca65f Merge pull request #479 from hiyouga/feature-addCmdExport
add sft script preview in webui
2023-08-12 20:41:52 +08:00
codemayq
6bc8e9866d add sft script preview in webui 2023-08-12 13:53:55 +08:00
hiyouga
dd51c24203 fix unusual output of 8bit models #278 #391 2023-08-12 00:25:29 +08:00
hiyouga
a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga
156710a995 Update README_zh.md 2023-08-11 14:06:02 +08:00
hiyouga
d3844e97e3 add defaults 2023-08-11 13:56:26 +08:00
hiyouga
d59f938959 fix stop word in baichuan template 2023-08-11 13:51:46 +08:00
hiyouga
9c6dd10514 fix baichuan template 2023-08-11 13:45:47 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hoshi-hiyouga
685dae4eff Merge pull request #451 from jovialchen/main
huggingface login for projects must login while running
2023-08-10 17:25:38 +08:00
hiyouga
ad6e7c76c7 fix webui val size 2023-08-10 15:20:44 +08:00
jiongxuc
3e000c2b60 huggingface login for projects must login while running 2023-08-10 14:57:12 +08:00
hiyouga
eb6e571cb7 fix template 2023-08-09 23:14:27 +08:00
hiyouga
ac29f4d5f0 fix template 2023-08-09 23:10:20 +08:00
hiyouga
d86ea314a1 support val set in streaming mode 2023-08-09 23:00:26 +08:00
hiyouga
572ea3bafb fix tokenizer 2023-08-09 17:52:15 +08:00
hiyouga
ef5b299b18 Update wechat.jpg 2023-08-09 17:36:17 +08:00
hiyouga
df946e6949 fix sft trainer 2023-08-09 16:35:03 +08:00
hiyouga
39cd8b6989 fix rm #420, fix template #426, fix #423 2023-08-09 16:23:31 +08:00
hoshi-hiyouga
2d90685358 fix llama2 template 2023-08-09 00:58:27 +08:00
hoshi-hiyouga
32fa5e8d70 fix tokenizer 2023-08-09 00:54:54 +08:00
hiyouga
3a720aac66 update webui 2023-08-09 00:26:11 +08:00
hiyouga
eecc4b2131 fix tokenizer #417 2023-08-08 23:59:41 +08:00
hiyouga
caa0eda27d fix bug 2023-08-08 21:28:28 +08:00
hiyouga
4b841a6b35 fix bug 2023-08-08 17:55:55 +08:00
hiyouga
a9980617f5 fix chatml template #408 2023-08-08 17:44:39 +08:00
hiyouga
5453b93db0 update args spec 2023-08-07 15:23:35 +08:00
hiyouga
20cf27976f update readme 2023-08-07 15:02:02 +08:00
hiyouga
cacd5b703d Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning 2023-08-07 13:59:16 +08:00
hiyouga
081345baca fix #376 2023-08-07 13:58:59 +08:00
hoshi-hiyouga
da42d289ee Merge pull request #382 from hiyouga/feature-updateReadme
add detailed model configs
2023-08-07 13:43:38 +08:00
hiyouga
220175ab24 update trainer 2023-08-07 13:34:35 +08:00
codemayq
293bd95712 add detailed model configs 2023-08-07 09:30:23 +08:00
hiyouga
e21ae01356 fix qwen eos token 2023-08-06 13:31:17 +08:00
hiyouga
7f18d2a335 fix qwen tokenizer #361 2023-08-05 17:06:05 +08:00
hiyouga
1afa51c2fa fix template for tiktoken 2023-08-05 13:42:42 +08:00
hiyouga
53d95725c5 remove redundant code 2023-08-05 00:27:27 +08:00
hiyouga
c183b3551d fix template 2023-08-05 00:25:00 +08:00