NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Joshua Ainslie
|
James Lee-Thorp
|
Michiel de Jong
|
Yury Zemlyanskiy
|
Federico Lebron
|
Sumit Sanghai
|
Paper Details:
Month: December
Year: 2023
Location: Singapore
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/google/flaxformer
https://cloud.google.com/tpu/docs/
https://github.com/google/flaxformer/
Field Of Study