Commit Graph

6 Commits

Author SHA1 Message Date
hiyouga
4489d73ac7 fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
2024-06-07 05:14:19 +08:00
hiyouga
2702d7e952 fix ppo in trl 0.8.6 2024-06-07 04:48:29 +08:00
hiyouga
74f96efef9 rename files 2024-06-07 00:09:06 +08:00
hiyouga
76c61905b2 fix ppo+zero3 #3108 2024-06-06 23:30:07 +08:00
hiyouga
65cd8bdbdb 10x generate in ppo w/ zero3
https://github.com/huggingface/trl/pull/1483
2024-05-29 00:23:23 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00