ldwang d1a35c3fb1 Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3.
Signed-off-by: ldwang <ftgreat@gmail.com>

Former-commit-id: 5f50c02f0e425737cd80abdf8fde9e25abf13083
2024-01-24 15:25:31 +08:00
..
2024-01-20 20:15:56 +08:00
2024-01-21 00:03:09 +08:00
2023-08-03 13:28:28 +08:00
2023-08-03 12:43:12 +08:00
2023-12-15 23:50:15 +08:00
2023-12-15 23:50:15 +08:00