Commit Graph

587 Commits

Author SHA1 Message Date
hiyouga
a38dbf55e3 fix #1682 2023-11-30 20:03:32 +08:00
hiyouga
509abe8864 add models 2023-11-30 19:16:13 +08:00
yuze.zyz
fb2204c183 fix 2023-11-29 21:43:58 +08:00
yuze.zyz
d38a2e7341 support ms 2023-11-29 20:36:55 +08:00
hiyouga
77d1b14fc2 fix #1658 2023-11-28 20:57:24 +08:00
hiyouga
475a3fa0f4 fix #1659 2023-11-28 20:52:28 +08:00
hiyouga
859a6ea942 support export size setting 2023-11-26 18:34:09 +08:00
hiyouga
ff1c289229 support Yi-34B-Chat models 2023-11-23 19:31:49 +08:00
hiyouga
35c2da3eba set version 2023-11-20 22:57:44 +08:00
hiyouga
9ea9380145 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569 2023-11-20 22:52:11 +08:00
hiyouga
5021062493 update ppo trainer 2023-11-20 21:39:15 +08:00
hoshi-hiyouga
48211e3799 Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
2023-11-20 20:32:55 +08:00
hiyouga
2a36fd5064 fix value head model resuming 2023-11-20 19:01:37 +08:00
hiyouga
99a3f06377 fix #1567 2023-11-20 18:46:36 +08:00
hiyouga
00baaa990e better data streaming 2023-11-19 23:32:47 +08:00
hiyouga
211b2db5a8 fix model card network issue 2023-11-19 23:03:19 +08:00
hiyouga
bfb9433165 fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547
2023-11-19 16:29:30 +08:00
hiyouga
065bfaeed4 fix #1263 2023-11-19 16:05:18 +08:00
hiyouga
1740131d63 fix #1558 2023-11-19 14:15:47 +08:00
hiyouga
ff6056405d fix evaluator and cached_file in 4.31.0 2023-11-18 19:39:23 +08:00
hiyouga
ccb0f58e22 fix quantization 2023-11-17 22:21:29 +08:00
hiyouga
1bbc1be95e fix #1550 2023-11-17 17:23:13 +08:00
Yuchen Han
eeb5249d0b Update workflow.py 2023-11-17 00:16:27 -08:00
Yuchen Han
b24635d22b Update finetuning_args.py 2023-11-17 00:15:51 -08:00
hiyouga
999bc0ed93 fix packages 2023-11-17 16:11:48 +08:00
Shaowen Wang
397e948984 Fix: Change rouge-chinese package name to rouge_chinese
To reproduce:
python:
importlib.util.find_spec('rouge-chinese') -> None
importlib.util.find_spec('rouge_chinese') -> ModuleSpec(name='rouge_chinese'...)
from rouge_chinese import Rouge
print(Rouge.__module__) -> rouge_chinese
2023-11-16 20:12:35 -06:00
hiyouga
ed9f7705ef fix chatglm template 2023-11-16 22:54:15 +08:00
hiyouga
10ce87e088 fix web ui demo 2023-11-16 18:41:55 +08:00
hiyouga
1c80e9a09e fix web ui demo 2023-11-16 17:12:23 +08:00
hiyouga
c4facc03af release v0.3.0 2023-11-16 16:00:11 +08:00
hiyouga
08f3c11429 fix css 2023-11-16 15:45:38 +08:00
hiyouga
6efa38be46 fix bug in web ui 2023-11-16 15:21:24 +08:00
hiyouga
7537dd434f update ppo and demo in webui 2023-11-16 14:55:26 +08:00
hiyouga
ff52b1779c fix bug in freeze tuning 2023-11-16 14:25:11 +08:00
hiyouga
83cee2a604 tiny fix 2023-11-16 03:27:19 +08:00
hiyouga
1817ffc86f fix rlhf callback 2023-11-16 03:26:19 +08:00
hiyouga
856522a3df fix bug in PPO training 2023-11-16 02:32:54 +08:00
hiyouga
35b91ea34c fix import bug 2023-11-16 02:27:03 +08:00
hiyouga
ce78303600 support full-parameter PPO 2023-11-16 02:08:04 +08:00
hiyouga
8350bcf85d add demo mode for web UI 2023-11-15 23:51:26 +08:00
hiyouga
1e19cf242a update readme and constants 2023-11-15 18:04:37 +08:00
hiyouga
4907452d95 support multiple modules in freeze training #1514 2023-11-15 17:08:18 +08:00
hiyouga
bbbce1f516 fix imports 2023-11-15 16:47:45 +08:00
hiyouga
4736344eb1 disentangle model from tuner and rename modules 2023-11-15 16:29:09 +08:00
hiyouga
2f02f688e1 fix #1507 2023-11-15 16:22:32 +08:00
hiyouga
42c8fc4fb9 add cal_lr.py 2023-11-14 20:58:37 +08:00
hiyouga
d125ef5535 fix #1494 2023-11-14 18:07:20 +08:00
hiyouga
3743b7420b fix #1489 2023-11-14 15:27:05 +08:00
hiyouga
2d42be32c1 support eval remote dataset 2023-11-14 02:42:30 +08:00
hiyouga
35cc1e28f6 release v0.2.2, fix #1478 #1466 2023-11-13 23:09:05 +08:00