NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition
Lu Ye
|
Ze Tao
|
Yong Huang
|
Yang Li
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/
https://github.com/lupantech/chameleon-llm/
https://github.com/qiancheng0/CREATOR/blob/
https://github.com/night-chen/ToolQA/blob/
https://docs.anthropic.com/claude/docs/
https://github.com/jujumilk3/
https://github.com/huggingface/
https://github.com/
https://platform
https://platform.openai.com/docs/guides/
https://cookbook.openai.com/examples/
https://github.com/mtrebi/
Field Of Study