NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Squeezed Attention: Accelerating Long Context Length LLM Inference
Coleman Richard Charles Hooper
|
Sehoon Kim
|
Hiva Mohammadzadeh
|
Monishwaran Maheswaran
|
Sebastian Zhao
|
June Paik
|
Michael W. Mahoney
|
Kurt Keutzer
|
Amir Gholami
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/
https://info.arxiv.org/
https://www.anthropic
https://crfm.stanford.edu/2023/10/12/
https://blog
https://github.com/triton-
https://arxiv
Field Of Study