GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference

Chao Zeng | Songwei Liu | Shu Yang | Fangmin Chen | Xing Mei | Lean Fu |

Paper Details:

Month: December
Year: 2025
Location: Mumbai, India
Venue: IJCNLP | AACL |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study