MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts

Wei Tao | Haocheng Lu | Xiaoyang Qu | Bin Zhang | Kai Lu | Jiguang Wan | Jianzong Wang |

Paper Details:

Month: July
Year: 2025
Location: Vienna, Austria
Venue: ACL |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study