Amphista: Bi-directional Multi-head Decoding for Accelerating LLM Inference

Zeping Li | Xinlong Yang | Ziheng Gao | Ji Liu | Guanchen Li | Zhuang Liu | Dong Li | Jinzhang Peng | Lu Tian | Emad Barsoum |

Paper Details:

Month: April
Year: 2025
Location: Albuquerque, New Mexico
Venue: NAACL |

Citations

URL

No Citations Yet