NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
Longwei Zou
|
Qingyang Wang
|
Han Zhao
|
Jiangangkong Jiangangkong
|
Yi Yang
|
Yangdong Deng
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand
Venue:
ACL |
Citations
URL
No Citations Yet
https://github.com/Photooon/CQIL
https://github.com/kingoflolz/
Field Of Study