hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
0dd632fe9e
|
add TeleChat-12B/TeleChat-12B-v2 models
|
2024-05-29 15:00:37 +08:00 |
|
Yimi81
|
dc07413e7d
|
fix yi template
|
2024-05-27 13:11:25 +00:00 |
|
hiyouga
|
c1fdf81df6
|
tiny fix
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
f1002b9f93
|
Update template.py
|
2024-05-27 20:51:56 +08:00 |
|
hoshi-hiyouga
|
122213a7a7
|
Update template.py
|
2024-05-27 20:51:26 +08:00 |
|
Jianbai Ye
|
cff815391f
|
add openchat-3.6-8B support
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
5581cb2e4e
|
update readme
|
2024-05-27 18:14:02 +08:00 |
|
seanzhang-zhichen
|
27cb51f7f8
|
Merge branch 'main' into add_dataset_sample_num
|
2024-05-24 15:57:47 +08:00 |
|
hiyouga
|
3a023bca2a
|
refactor data preprocessing, fix mllm rlhf
|
2024-05-24 04:08:25 +08:00 |
|
hiyouga
|
de0e67aff1
|
fix paligemma sft
requires transformers>=4.41.1
|
2024-05-24 00:23:40 +08:00 |
|
hiyouga
|
7134fb02bb
|
fix paligemma sft
|
2024-05-21 20:03:09 +08:00 |
|
hiyouga
|
e55c85ac72
|
fix paligemma data preprocess
|
2024-05-20 23:51:32 +08:00 |
|
hiyouga
|
542229abb3
|
fix paligemma inference
|
2024-05-20 23:36:43 +08:00 |
|
zhangzc
|
d956041640
|
fix conflict
|
2024-05-20 17:10:01 +08:00 |
|
hiyouga
|
d52fae2fa8
|
fix chat engines
do not use pop(key, default) since api assigns None to dict values
|
2024-05-20 00:36:43 +08:00 |
|
hiyouga
|
10573e1639
|
fix jinja template
|
2024-05-19 23:38:30 +08:00 |
|
hiyouga
|
a851056229
|
improve data process logger
|
2024-05-18 22:02:42 +08:00 |
|
hiyouga
|
0edc16769f
|
fix #3803
|
2024-05-18 16:13:14 +08:00 |
|
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
|
enji.zhou
|
db1d5a4f51
|
add kto
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
|