A Survey on Model Compression for Large Language Models

Xunyu Zhu | Jian Li | Yong Liu | Can Ma | Weiping Wang |

Paper Details:


Year: 2024
Location: Cambridge, MA
Venue: TACL |