hiyouga
|
d125ef5535
|
fix #1494
|
2023-11-14 18:07:20 +08:00 |
|
hiyouga
|
386f590209
|
add template, modify datasets
|
2023-11-09 15:53:23 +08:00 |
|
lvzi
|
043c316ac8
|
fix tokenizer config changed after pretrain
Changing tokenizer's attribute at preprocessing stage will result in saving a wrong tokenizer.
for example, baichuan2
|
2023-11-08 15:50:46 +08:00 |
|
hiyouga
|
b2a60905f3
|
upgrade peft, fix #1088 #1411
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
a837172413
|
support sharegpt format, add datasets
|
2023-11-02 23:10:04 +08:00 |
|
hiyouga
|
3fe7df628d
|
support dataset cache
|
2023-10-26 21:48:45 +08:00 |
|
hiyouga
|
2caf91f824
|
remove filter in preprocess
|
2023-10-23 23:46:02 +08:00 |
|
hiyouga
|
af18b0dce7
|
fix #1184
|
2023-10-14 19:20:11 +08:00 |
|
hiyouga
|
be420e4179
|
fix aquila template, repair sft packing mechanism
|
2023-10-10 18:49:55 +08:00 |
|
hiyouga
|
de19614306
|
fix bug in packed sft dataset
|
2023-09-28 01:16:46 +08:00 |
|
hiyouga
|
d2ebd225db
|
tiny fix
|
2023-09-28 01:02:11 +08:00 |
|
hiyouga
|
b3fbba57eb
|
fix bug in pretraining
|
2023-09-28 00:45:20 +08:00 |
|
hiyouga
|
90375f600d
|
support LongLoRA
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
338b8664ed
|
fix #944
|
2023-09-21 19:51:02 +08:00 |
|
hiyouga
|
8632bff811
|
fix #896
|
2023-09-14 18:37:34 +08:00 |
|
hiyouga
|
a51b7c98ac
|
fix lora target
|
2023-09-09 17:04:45 +08:00 |
|
hiyouga
|
85b1f6632a
|
fix baichuan templates
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
370bdb6e43
|
fix #763
|
2023-09-01 23:13:05 +08:00 |
|
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
|
hiyouga
|
b0ed0dec5e
|
fix streaming in pt stage #548 #549
|
2023-08-17 17:59:26 +08:00 |
|
hiyouga
|
7407d9daa1
|
fix system prompt
|
2023-08-16 01:35:52 +08:00 |
|
hiyouga
|
d3844e97e3
|
add defaults
|
2023-08-11 13:56:26 +08:00 |
|
hiyouga
|
9c6dd10514
|
fix baichuan template
|
2023-08-11 13:45:47 +08:00 |
|
hiyouga
|
3ec4351cfd
|
support DPO training (2305.18290)
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
d86ea314a1
|
support val set in streaming mode
|
2023-08-09 23:00:26 +08:00 |
|
hiyouga
|
39cd8b6989
|
fix rm #420, fix template #426, fix #423
|
2023-08-09 16:23:31 +08:00 |
|
hiyouga
|
eecc4b2131
|
fix tokenizer #417
|
2023-08-08 23:59:41 +08:00 |
|
hiyouga
|
caa0eda27d
|
fix bug
|
2023-08-08 21:28:28 +08:00 |
|
hiyouga
|
a9980617f5
|
fix chatml template #408
|
2023-08-08 17:44:39 +08:00 |
|
hiyouga
|
20cf27976f
|
update readme
|
2023-08-07 15:02:02 +08:00 |
|
hiyouga
|
d87c8fd8ab
|
fix bos and eos token
|
2023-08-04 23:55:57 +08:00 |
|
hiyouga
|
8172ad1b5e
|
fix encode
|
2023-08-04 23:27:55 +08:00 |
|
hiyouga
|
b4852f9406
|
support chatml safe encoding
|
2023-08-04 23:14:28 +08:00 |
|
hiyouga
|
968ce0dcce
|
fix bug in preprocessing
|
2023-08-02 01:10:28 +08:00 |
|
hiyouga
|
e3f80774c4
|
fix #296
|
2023-08-01 18:43:53 +08:00 |
|
hiyouga
|
0411a4b3e1
|
support streaming data, fix #284 #274 #268
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
ed0e186a13
|
update web UI, support rm predict #210
|
2023-07-21 13:27:27 +08:00 |
|
hiyouga
|
67a2773074
|
simplify code
|
2023-07-20 15:08:57 +08:00 |
|
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
|