Login / Signup
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints.
Joshua Ainslie
James Lee-Thorp
Michiel de Jong
Yury Zemlyanskiy
Federico Lebrón
Sumit Sanghai
Published in:
EMNLP (2023)
Keyphrases
</>
database
query processing
response time
query expansion
range queries
structured prediction
databases
control system
probabilistic model
relevance feedback
model selection
similarity search
query formulation