DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts

Yuchen Feng | Bowen Shen | Naibin Gu | Jiaxuan Zhao | Peng Fu | Zheng Lin | Weiping Wang |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |

Citations

URL

No Citations Yet