NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Accurate KV Cache Quantization with Outlier Tokens Tracing
Yi Su
|
Yuechi Zhou
|
Quantong Qiu
|
Juntao Li
|
Qingrong Xia
|
Ping Li
|
Xinyu Duan
|
Zhefeng Wang
|
Min Zhang
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/yisunlp/OTT
https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
https://github.com/declare-lab/instruct-eval
Field Of Study