NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
Yifei Liu
|
Jicheng Wen
|
Yang Wang
|
Shengyu Ye
|
Li Lyna Zhang
|
Ting Cao
|
Cheng Li
|
Mao Yang
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/microsoft/VPTQ
Field Of Study