hiyouga 0b0e27c2f1 fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training


Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
2024-07-03 19:45:51 +08:00
..
2024-06-22 00:00:38 +08:00
2024-07-02 22:23:08 +08:00
2024-06-16 01:06:41 +08:00
2024-07-02 22:23:08 +08:00
2024-07-01 01:19:27 +08:00
2024-07-03 02:31:50 +08:00
2024-07-03 19:45:51 +08:00
2024-06-28 06:00:26 +08:00
2024-06-15 17:54:33 +08:00
2024-06-27 20:14:48 +08:00
2024-06-15 17:54:33 +08:00