HookMoE: A learnable performance compensation strategy of Mixture-of-Experts for LLM inference acceleration

Cheng Longkai | Along He | Mulin Li | Xie Xueshuo | Tao Li |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |