NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
Aurick Qiao
|
Zhewei Yao
|
Samyam Rajbhandari
|
Yuxiong He
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/sno
https://gi
https://github.com/snowf
https://github.com/snowflakedb/arcti
https://huggingface.co/gretelai/synthetic-
https://huggingface.co/neuralmagic/Meta-Llama-3.1-8B-Instruc
https://github.com/neuralmagic/lm-evaluation-harness
Field Of Study