Commit Graph

5 Commits

Author SHA1 Message Date
hiyouga
e0aadd4b34 fix ppo dataset bug #4012
Former-commit-id: 149610c636
2024-06-06 19:03:20 +08:00
hiyouga
e898d8bbc4 update trainers
Former-commit-id: fad2591e31
2024-06-06 18:45:49 +08:00
hiyouga
468d0e7ed1 10x generate in ppo w/ zero3
https://github.com/huggingface/trl/pull/1483

Former-commit-id: 65cd8bdbdb
2024-05-29 00:23:23 +08:00
hiyouga
13d7b48efe improve KTO impl., replace datasets
Former-commit-id: c450ee87a3
2024-05-18 03:44:56 +08:00
hiyouga
cae823ddf0 rename package
Former-commit-id: 308edbc426
2024-05-16 18:39:08 +08:00