Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling.

Kyuhong Shim Iksoo Choi Wonyong Sung Jungwook Choi

Published in: ISOCC (2021)

Keyphrases

language modeling
language model
retrieval model
information retrieval
query expansion
information retrieval systems
statistical language models
test collection
neural network
probabilistic model
term weighting schemes
improvements in retrieval effectiveness