hiyouga
|
ccc8b64cc2
|
update data processors
|
2024-06-07 04:15:40 +08:00 |
|
hoshi-hiyouga
|
181dbb0d05
|
Merge pull request #4009 from AlongWY/main
supervised packing with greedy knapsack algorithm
|
2024-06-07 03:48:46 +08:00 |
|
hoshi-hiyouga
|
c09ad8bab3
|
Update supervised.py
|
2024-06-07 03:42:08 +08:00 |
|
hoshi-hiyouga
|
788e8232fc
|
Update supervised.py
|
2024-06-07 03:38:23 +08:00 |
|
hoshi-hiyouga
|
8cecade708
|
Update supervised.py
|
2024-06-07 03:38:04 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
149610c636
|
fix ppo dataset bug #4012
|
2024-06-06 19:03:20 +08:00 |
|
hiyouga
|
f48f5e646e
|
support glm-4
|
2024-06-05 15:16:38 +08:00 |
|
hiyouga
|
5a13b3baa6
|
tiny fix
|
2024-06-04 00:31:10 +08:00 |
|
hiyouga
|
a18acf2abe
|
fix #3992
|
2024-06-04 00:17:36 +08:00 |
|
hiyouga
|
49b1e88e3d
|
fix data loader hint
|
2024-06-03 18:28:27 +08:00 |
|
ylfeng
|
b47e317447
|
remove empty line
|
2024-05-31 21:43:08 +08:00 |
|
ylfeng
|
84aee57901
|
fix eos
|
2024-05-31 21:40:41 +08:00 |
|
ylfeng
|
f9db439cb7
|
supervised packing with greedy knapsack algorithm
|
2024-05-31 15:33:54 +08:00 |
|
hoshi-hiyouga
|
483eb47e5d
|
Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num
Add dataset sample num
|
2024-05-30 00:25:45 +08:00 |
|
hoshi-hiyouga
|
ca5dd7c6c1
|
Update loader.py
|
2024-05-30 00:20:20 +08:00 |
|
hoshi-hiyouga
|
f9a88b89ca
|
Update loader.py
|
2024-05-30 00:17:21 +08:00 |
|
hoshi-hiyouga
|
b55fb611c5
|
Update loader.py
|
2024-05-30 00:12:12 +08:00 |
|
hoshi-hiyouga
|
51dd454337
|
Update parser.py
|
2024-05-30 00:05:20 +08:00 |
|
hiyouga
|
d0aa36b8ad
|
fix cohere system
|
2024-05-29 20:58:23 +08:00 |
|
hiyouga
|
0930f58699
|
fix #3965
|
2024-05-29 20:55:51 +08:00 |
|
hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
0dd632fe9e
|
add TeleChat-12B/TeleChat-12B-v2 models
|
2024-05-29 15:00:37 +08:00 |
|
Yimi81
|
dc07413e7d
|
fix yi template
|
2024-05-27 13:11:25 +00:00 |
|
hiyouga
|
c1fdf81df6
|
tiny fix
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
f1002b9f93
|
Update template.py
|
2024-05-27 20:51:56 +08:00 |
|
hoshi-hiyouga
|
122213a7a7
|
Update template.py
|
2024-05-27 20:51:26 +08:00 |
|
Jianbai Ye
|
cff815391f
|
add openchat-3.6-8B support
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
5581cb2e4e
|
update readme
|
2024-05-27 18:14:02 +08:00 |
|
seanzhang-zhichen
|
27cb51f7f8
|
Merge branch 'main' into add_dataset_sample_num
|
2024-05-24 15:57:47 +08:00 |
|
hiyouga
|
3a023bca2a
|
refactor data preprocessing, fix mllm rlhf
|
2024-05-24 04:08:25 +08:00 |
|
hiyouga
|
de0e67aff1
|
fix paligemma sft
requires transformers>=4.41.1
|
2024-05-24 00:23:40 +08:00 |
|
hiyouga
|
7134fb02bb
|
fix paligemma sft
|
2024-05-21 20:03:09 +08:00 |
|
hiyouga
|
e55c85ac72
|
fix paligemma data preprocess
|
2024-05-20 23:51:32 +08:00 |
|
hiyouga
|
542229abb3
|
fix paligemma inference
|
2024-05-20 23:36:43 +08:00 |
|
zhangzc
|
d956041640
|
fix conflict
|
2024-05-20 17:10:01 +08:00 |
|
hiyouga
|
d52fae2fa8
|
fix chat engines
do not use pop(key, default) since api assigns None to dict values
|
2024-05-20 00:36:43 +08:00 |
|
hiyouga
|
10573e1639
|
fix jinja template
|
2024-05-19 23:38:30 +08:00 |
|
hiyouga
|
a851056229
|
improve data process logger
|
2024-05-18 22:02:42 +08:00 |
|
hiyouga
|
0edc16769f
|
fix #3803
|
2024-05-18 16:13:14 +08:00 |
|
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
|
enji.zhou
|
db1d5a4f51
|
add kto
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
|