NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Speculative Streaming: Efficient and Scalable Speculative Decoding with Multi-Stream Attention
Nikhil Bhendawade
|
Irina Belousova
|
Qichen Fu
|
Henry Mason
|
Antonie Lin
|
Mohammad Rastegari
|
Mahyar Najibi
|
Paper Details:
Month: November
Year: 2025
Location: Suzhou, China
Venue:
EMNLP |
Citations
URL
No Citations Yet
No URLs Found
Field Of Study