HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts

Truong Giang Do | Le Khiem | Quang Pham | TrungTin Nguyen | Thanh-Nam Doan | Binh Nguyen | Chenghao Liu | Savitha Ramasamy | Xiaoli Li | Steven Hoi |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: EMNLP |

Citations

URL

No Citations Yet