hiyouga
|
8fdff07e1f
|
fix openchat template
|
2023-10-21 01:25:42 +08:00 |
|
hiyouga
|
b665e9e133
|
fix #1232
|
2023-10-20 23:28:52 +08:00 |
|
hiyouga
|
273745f9b9
|
fix eval resuming in webui
|
2023-10-15 15:45:38 +08:00 |
|
hiyouga
|
3ad8c92eca
|
tiny fix
|
2023-10-15 05:02:48 +08:00 |
|
hiyouga
|
1e9401744c
|
fix callback
|
2023-10-15 04:59:44 +08:00 |
|
hiyouga
|
accde3cd39
|
implement webui resuming training
|
2023-10-15 04:52:19 +08:00 |
|
hiyouga
|
2818af0b09
|
refactor model_dtype, fix PPO trainer
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
be420e4179
|
fix aquila template, repair sft packing mechanism
|
2023-10-10 18:49:55 +08:00 |
|
hiyouga
|
0a356bc897
|
fix flash shift short attention
|
2023-10-09 17:54:48 +08:00 |
|
hiyouga
|
ab65c3063b
|
fix shift short attention
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
d11a545463
|
fix #1068 #1074
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
5d4118b096
|
tiny fix
|
2023-09-28 01:03:04 +08:00 |
|
hiyouga
|
d2ebd225db
|
tiny fix
|
2023-09-28 01:02:11 +08:00 |
|
hiyouga
|
c902236397
|
fix #1064
|
2023-09-28 00:53:29 +08:00 |
|
hiyouga
|
84b7486885
|
fix layer norm dtype
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
90375f600d
|
support LongLoRA
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
465ee8119a
|
add MMLU and C-Eval script
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
7e8655c8b5
|
fix error info
|
2023-09-19 18:30:23 +08:00 |
|
hiyouga
|
d4be857e23
|
fix #762 #814
|
2023-09-12 16:10:10 +08:00 |
|
hiyouga
|
3b306478d4
|
tiny fix
|
2023-09-11 18:27:08 +08:00 |
|
hiyouga
|
0fbece85a7
|
update flashattn, fix ppo save model
|
2023-09-11 17:25:36 +08:00 |
|
hiyouga
|
b218c271ed
|
remove PeftTrainer
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
d8aa1404be
|
support FlashAttention2
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
a51b7c98ac
|
fix lora target
|
2023-09-09 17:04:45 +08:00 |
|
hiyouga
|
bca1a247bc
|
support lora target auto find
|
2023-09-09 15:38:37 +08:00 |
|
hiyouga
|
8ea32e4046
|
change to right-padding, update reward score #803
|
2023-09-08 20:04:31 +08:00 |
|
hiyouga
|
8aaaa132d4
|
fix chatglm template
|
2023-09-08 14:45:58 +08:00 |
|
hiyouga
|
85b1f6632a
|
fix baichuan templates
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
0531886e1f
|
update baichuan2 template
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
62ce65c628
|
add Baichuan2 models
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
0bcc489c42
|
update llama2 template
|
2023-08-30 16:23:56 +08:00 |
|
codemayq
|
c0e4d1e81b
|
add dataset stage and filter dataset when stage chosen in webui
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
4318347d3f
|
update template
|
2023-08-22 19:46:09 +08:00 |
|
hiyouga
|
02d69b6fde
|
fix #608
|
2023-08-21 17:49:36 +08:00 |
|
hiyouga
|
0a3f698425
|
fix baichuan template for training #597 #616
|
2023-08-21 17:41:51 +08:00 |
|
hiyouga
|
9f4c2adc9a
|
fix ChatGLM2 ppo #527 #528
|
2023-08-18 00:34:59 +08:00 |
|
hiyouga
|
be21fc83f9
|
fix generation bug #532
|
2023-08-17 22:21:34 +08:00 |
|
hiyouga
|
892fd39373
|
fix baichuan and intern template
|
2023-08-17 01:27:20 +08:00 |
|
hiyouga
|
7407d9daa1
|
fix system prompt
|
2023-08-16 01:35:52 +08:00 |
|
hiyouga
|
273135f595
|
fix baichuan template #481
|
2023-08-15 11:38:21 +08:00 |
|
hiyouga
|
80b4053602
|
alert pad_token source
|
2023-08-15 00:07:56 +08:00 |
|
hiyouga
|
9d0f6214b6
|
update webui
|
2023-08-14 22:45:26 +08:00 |
|
codemayq
|
79c68e5527
|
add template match and stage in webui
|
2023-08-14 20:42:59 +08:00 |
|
hiyouga
|
fa940c17b8
|
support rope scaling, fix #475 #476 #478
|
2023-08-12 20:46:27 +08:00 |
|
codemayq
|
6bc8e9866d
|
add sft script preview in webui
|
2023-08-12 13:53:55 +08:00 |
|
hiyouga
|
dd51c24203
|
fix unusual output of 8bit models #278 #391
|
2023-08-12 00:25:29 +08:00 |
|
hiyouga
|
a48cb0d474
|
Release v0.1.6
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
d3844e97e3
|
add defaults
|
2023-08-11 13:56:26 +08:00 |
|
hiyouga
|
d59f938959
|
fix stop word in baichuan template
|
2023-08-11 13:51:46 +08:00 |
|