3 Commits

Author SHA1 Message Date
hiyouga
15cef791ba fix #1356
Former-commit-id: dff128c7e38dd079a5840ea4e73ee3e9bbd1c3c9
2023-11-02 16:51:52 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00
hiyouga
a696148d6b modity code structure
Former-commit-id: f75137661358f9070bc70c341dfa2cc5fd69cf94
2023-07-15 16:54:28 +08:00