Commit Graph

130 Commits

Author SHA1 Message Date
hiyouga
9a6b694e12 fix #1696
Former-commit-id: bf6f6aeefe
2023-12-01 15:34:50 +08:00
hiyouga
1c43fb6a41 add models
Former-commit-id: 509abe8864
2023-11-30 19:16:13 +08:00
hiyouga
ae1048db6d fix #1659
Former-commit-id: 475a3fa0f4
2023-11-28 20:52:28 +08:00
hiyouga
5f2943dc84 support Yi-34B-Chat models
Former-commit-id: ff1c289229
2023-11-23 19:31:49 +08:00
hiyouga
f06c4c8f7a update ppo trainer
Former-commit-id: 5021062493
2023-11-20 21:39:15 +08:00
hiyouga
48d6d925f7 fix #1558
Former-commit-id: 1740131d63
2023-11-19 14:15:47 +08:00
hiyouga
d3c4881ccb fix packages
Former-commit-id: 999bc0ed93
2023-11-17 16:11:48 +08:00
Shaowen Wang
4ea3144554 Fix: Change rouge-chinese package name to rouge_chinese
To reproduce:
python:
importlib.util.find_spec('rouge-chinese') -> None
importlib.util.find_spec('rouge_chinese') -> ModuleSpec(name='rouge_chinese'...)
from rouge_chinese import Rouge
print(Rouge.__module__) -> rouge_chinese
Former-commit-id: 397e948984
2023-11-16 20:12:35 -06:00
hiyouga
3f53155a90 fix bug in web ui
Former-commit-id: 6efa38be46
2023-11-16 15:21:24 +08:00
hiyouga
e4f97615f0 update ppo and demo in webui
Former-commit-id: 7537dd434f
2023-11-16 14:55:26 +08:00
hiyouga
627212e48b tiny fix
Former-commit-id: 83cee2a604
2023-11-16 03:27:19 +08:00
hiyouga
678052a7ef fix rlhf callback
Former-commit-id: 1817ffc86f
2023-11-16 03:26:19 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce78303600
2023-11-16 02:08:04 +08:00
hiyouga
3e0b76650a update readme and constants
Former-commit-id: 1e19cf242a
2023-11-15 18:04:37 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1
2023-11-15 16:29:09 +08:00
hiyouga
4a767e5593 release v0.2.2, fix #1478 #1466
Former-commit-id: 35cc1e28f6
2023-11-13 23:09:05 +08:00
hiyouga
125587b187 refactor evaluation, upgrade trl to 074
Former-commit-id: 442aefb925
2023-11-13 22:20:35 +08:00
hiyouga
982e0e79c2 fix flashattn warning
Former-commit-id: 4bd8e3906d
2023-11-10 18:34:54 +08:00
hiyouga
0fbaa42752 refactor constants
Former-commit-id: 3697a3dc9a
2023-11-10 14:16:10 +08:00
hiyouga
38755bced7 add template, modify datasets
Former-commit-id: 386f590209
2023-11-09 15:53:23 +08:00
hiyouga
1f2c56bff9 delete file
Former-commit-id: 479d0af2dc
2023-11-07 16:20:12 +08:00
hiyouga
3d40bdb600 upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f3
2023-11-07 16:13:36 +08:00
hiyouga
a919b6a478 update templates
Former-commit-id: a7eeb8e17c
2023-11-06 12:25:47 +08:00
hiyouga
034b658348 fix deepseek template
Former-commit-id: d08f5e8a14
2023-11-05 13:08:46 +08:00
hiyouga
04107b7af6 support deepseek coder #1378
Former-commit-id: 2a8a258195
2023-11-05 12:51:03 +08:00
hiyouga
6493f6d2e9 fix #1316
Former-commit-id: f4e4a04529
2023-10-31 11:32:08 +08:00
hiyouga
d48478ef88 update constants
Former-commit-id: f28a034a9b
2023-10-29 13:30:20 +08:00
hiyouga
bf0faf129d fix vicuna template
Former-commit-id: 52fc24d166
2023-10-27 22:15:25 +08:00
hiyouga
5705c82cd8 fix chatglm3 template
Former-commit-id: 4117f38827
2023-10-27 21:12:06 +08:00
hiyouga
8a76b1e499 support chatglm3
Former-commit-id: 1c0ab9a908
2023-10-27 19:16:28 +08:00
hiyouga
d18c708f14 fix openchat template
Former-commit-id: 8fdff07e1f
2023-10-21 01:25:42 +08:00
hiyouga
95697652f1 fix #1232
Former-commit-id: b665e9e133
2023-10-20 23:28:52 +08:00
hiyouga
0503d45782 fix eval resuming in webui
Former-commit-id: 273745f9b9
2023-10-15 15:45:38 +08:00
hiyouga
99592478c9 tiny fix
Former-commit-id: 3ad8c92eca
2023-10-15 05:02:48 +08:00
hiyouga
4f9ca28e11 fix callback
Former-commit-id: 1e9401744c
2023-10-15 04:59:44 +08:00
hiyouga
3ae6229140 implement webui resuming training
Former-commit-id: accde3cd39
2023-10-15 04:52:19 +08:00
hiyouga
c9d1cd108d refactor model_dtype, fix PPO trainer
Former-commit-id: 2818af0b09
2023-10-11 23:16:01 +08:00
hiyouga
141937ead6 fix aquila template, repair sft packing mechanism
Former-commit-id: be420e4179
2023-10-10 18:49:55 +08:00
hiyouga
180fd06e61 fix flash shift short attention
Former-commit-id: 0a356bc897
2023-10-09 17:54:48 +08:00
hiyouga
b6e81a0307 fix shift short attention
Former-commit-id: ab65c3063b
2023-10-09 17:07:46 +08:00
hiyouga
d338ab3e19 fix #1068 #1074
Former-commit-id: d11a545463
2023-09-28 14:39:16 +08:00
hiyouga
f61a000e73 tiny fix
Former-commit-id: 5d4118b096
2023-09-28 01:03:04 +08:00
hiyouga
8a8ba08bf7 tiny fix
Former-commit-id: d2ebd225db
2023-09-28 01:02:11 +08:00
hiyouga
755e3e49b4 fix #1064
Former-commit-id: c902236397
2023-09-28 00:53:29 +08:00
hiyouga
deb17942ab fix layer norm dtype
Former-commit-id: 84b7486885
2023-09-28 00:25:55 +08:00
hiyouga
108c31e1fc support LongLoRA
Former-commit-id: 90375f600d
2023-09-27 21:55:50 +08:00
hiyouga
5ee1bdecdc add MMLU and C-Eval script
Former-commit-id: 465ee8119a
2023-09-23 00:34:17 +08:00
hiyouga
48e7b600a8 fix error info
Former-commit-id: 7e8655c8b5
2023-09-19 18:30:23 +08:00
hiyouga
4e86462bad fix #762 #814
Former-commit-id: d4be857e23
2023-09-12 16:10:10 +08:00
hiyouga
8ac7ec0b48 tiny fix
Former-commit-id: 3b306478d4
2023-09-11 18:27:08 +08:00