Commit Graph

4 Commits

Author SHA1 Message Date
jiaqiw09
28a6ea1cdc [v1] add deepspeed zero3 trigger for low memory usage weight loading (#10300) 2026-04-21 14:09:52 +08:00
jiaqiw09
df2e6edb7e [v1] add init on rank0 for fsdp2 (#10264) 2026-03-27 14:54:03 +08:00
Yaowei Zheng
122cd46084 [model] update constants (#10220) 2026-02-26 21:13:56 +08:00
浮梦
5c52afa30d [v1] support deepspeed (#10181) 2026-02-12 17:24:30 +08:00