VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Yifei Liu | Jicheng Wen | Yang Wang | Shengyu Ye | Li Lyna Zhang | Ting Cao | Cheng Li | Mao Yang |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: EMNLP |

Citations

URL

No Citations Yet