Faster In-Context Learning for LLMs via N-Gram Trie Speculative Decoding

Jinglin Chen | Qiwei Li | Zuchao Li | Baoyuan Qi | Liu Guoming | Haojun Ai | Hai Zhao | Ping Wang |

Paper Details:

Month: November
Year: 2025
Location: Suzhou, China
Venue: EMNLP |

Citations

URL

No Citations Yet