34 Commits

Author SHA1 Message Date
hiyouga
e0aadd4b34 fix ppo dataset bug #4012
Former-commit-id: 149610c636bbb974e546d13fa302884ea65a6d38
2024-06-06 19:03:20 +08:00
hiyouga
94c37490d1 support glm-4
Former-commit-id: f48f5e646e2da9e02333d027033141b0e75dfcf8
2024-06-05 15:16:38 +08:00
hiyouga
0eff6a66d5 tiny fix
Former-commit-id: 5a13b3baa63225e7f79e024610722de0f87e0acc
2024-06-04 00:31:10 +08:00
hiyouga
8ecf606230 fix #3992
Former-commit-id: a18acf2abe28e37233bf8c8ed2600618ea3b62e9
2024-06-04 00:17:36 +08:00
hiyouga
64d24842fe fix data loader hint
Former-commit-id: 49b1e88e3da3be0fb78f53e5f924a9be67568a02
2024-06-03 18:28:27 +08:00
hoshi-hiyouga
9b6bdf9449 Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num
Add dataset sample num

Former-commit-id: 483eb47e5d670e23fb713b942f6890b8259f4363
2024-05-30 00:25:45 +08:00
hoshi-hiyouga
7b83c550ab Update loader.py
Former-commit-id: ca5dd7c6c115a359e4b50e93f4ffcc9f2955ec2f
2024-05-30 00:20:20 +08:00
hoshi-hiyouga
9fc713da89 Update loader.py
Former-commit-id: f9a88b89ca8b8f9a0c5def03b154f9d67f558edf
2024-05-30 00:17:21 +08:00
hoshi-hiyouga
c0f11a280e Update loader.py
Former-commit-id: b55fb611c57be03fb38218c7da1d96f6848496ba
2024-05-30 00:12:12 +08:00
hoshi-hiyouga
69a51cacb1 Update parser.py
Former-commit-id: 51dd454337941801d0a66eaadb0da2e007e9573d
2024-05-30 00:05:20 +08:00
hiyouga
19a3262387 fix cohere system
Former-commit-id: d0aa36b8ad02287d97930101958456c523e699d3
2024-05-29 20:58:23 +08:00
hiyouga
c05cb3769f fix #3965
Former-commit-id: 0930f5869929634baa0881167d3d6c714afc63d9
2024-05-29 20:55:51 +08:00
hiyouga
a71a6a05c3 update readme
Former-commit-id: 89ca832740731dfb121175aa5c16b13bd4944011
2024-05-29 18:39:11 +08:00
hzhaoy
ce1be3da4b add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: 0dd632fe9e5bbf08605d4b9c6887208b7a127317
2024-05-29 15:00:37 +08:00
Yimi81
7324984127 fix yi template
Former-commit-id: dc07413e7d0b138c89eacaef17596e83ef226540
2024-05-27 13:11:25 +00:00
hiyouga
0706dbf7e6 tiny fix
Former-commit-id: c1fdf81df6ade5da7be4eb66b715f0efd171d5aa
2024-05-27 20:54:26 +08:00
hoshi-hiyouga
eceec1d7fd Update template.py
Former-commit-id: f1002b9f930758bb27794ab88a2adbe24417b076
2024-05-27 20:51:56 +08:00
hoshi-hiyouga
b7b8223230 Update template.py
Former-commit-id: 122213a7a7e114b0c390158cac0ae9faeceb2efc
2024-05-27 20:51:26 +08:00
Jianbai Ye
d2c1df7f3d add openchat-3.6-8B support
Former-commit-id: cff815391fd15f30647e8694e08c47a514fd6eb2
2024-05-27 20:42:08 +08:00
hiyouga
df33548b39 update readme
Former-commit-id: 5581cb2e4e59f3f8109e2acd4611789f9e50bfca
2024-05-27 18:14:02 +08:00
seanzhang-zhichen
9c8d79fbe3 Merge branch 'main' into add_dataset_sample_num
Former-commit-id: 27cb51f7f86f97ae231abfdcb0114ff245d7af9c
2024-05-24 15:57:47 +08:00
hiyouga
3e729798df refactor data preprocessing, fix mllm rlhf
Former-commit-id: 3a023bca2a502810a436cfba7708df164754ea62
2024-05-24 04:08:25 +08:00
hiyouga
d3490aceb7 fix paligemma sft
requires transformers>=4.41.1


Former-commit-id: de0e67aff13f191fd899ad717ec349a6bdb14f2a
2024-05-24 00:23:40 +08:00
hiyouga
4ddc1c9c16 fix paligemma sft
Former-commit-id: 7134fb02bbdc9421f6c314ae176d5786a8cd768d
2024-05-21 20:03:09 +08:00
hiyouga
a935c5105d fix paligemma data preprocess
Former-commit-id: e55c85ac72f4938738dbce576f83b47a1fea88ae
2024-05-20 23:51:32 +08:00
hiyouga
446c681b58 fix paligemma inference
Former-commit-id: 542229abb3aba2032d4c52a878c0fd35ba299691
2024-05-20 23:36:43 +08:00
zhangzc
4b90f04c1f fix conflict
Former-commit-id: d956041640d9abc5e59919a227d27270fb513a7e
2024-05-20 17:10:01 +08:00
hiyouga
864da49139 fix chat engines
do not use pop(key, default) since api assigns None to dict values


Former-commit-id: d52fae2fa866afeb6156dc98388ce5cc6d5eca77
2024-05-20 00:36:43 +08:00
hiyouga
0e57bb201c fix jinja template
Former-commit-id: 10573e1639e7a71813927a8bfff3b036c21064c3
2024-05-19 23:38:30 +08:00
hiyouga
519d2511ae improve data process logger
Former-commit-id: a851056229f37391023627180b5712ed64ae3528
2024-05-18 22:02:42 +08:00
hiyouga
1e867c0fa0 fix #3803
Former-commit-id: 0edc16769f7e84b74e5fc6a1382e284632567c4c
2024-05-18 16:13:14 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
enji.zhou
03956053b8 add kto
Former-commit-id: db1d5a4f51faae61fe18666057353747b01f5b8d
2024-05-17 13:09:17 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00