FoldMoE: Efficient Long Sequence MoE Training via Attention-MoE Pipelining

Guichao Zhu | Lintian Lei | Yuhao Qing | Yichao Fu | Fanxin Li | Dong Huang | Zekai Sun | Heming Cui |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |

Citations

URL

No Citations Yet