NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs
Tao Ji
|
Bin Guo
|
Yuanbin Wu
|
Qipeng Guo
|
Shenlixing Shenlixing
|
Chenzhan Chenzhan
|
Xipeng Qiu
|
Qi Zhang
|
Tao Gui
|
Paper Details:
Month: July
Year: 2025
Location: Vienna, Austria
Venue:
ACL |
Citations
URL
No Citations Yet
https://github
https://huggingface.co/collections/
https://huggingface.co/meta-llama/Llama-2-7b
https://huggingface.co/blog/
https://huggingface.co/blog/smollm
https://huggingface.co/datasets/
https://huggingface.co/datasets/
https://huggingface.co/datasets/bigcode/
Field Of Study