4 Commits

Author SHA1 Message Date
hiyouga
4807c11db8 support SimPO #3900
Former-commit-id: cb63b32986c43f97994211ec34dc5928fc3bb9d7
2024-05-26 23:46:33 +08:00
hiyouga
3e729798df refactor data preprocessing, fix mllm rlhf
Former-commit-id: 3a023bca2a502810a436cfba7708df164754ea62
2024-05-24 04:08:25 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a35ff9235f9b695b0de2e042b2971178
2024-05-18 03:44:56 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc4260d45907b4a9d3a45ec21d83e48aacb
2024-05-16 18:39:08 +08:00