Login / Signup
Multi-head or Single-head? An Empirical Comparison for Transformer Training.
Liyuan Liu
Jialu Liu
Jiawei Han
Published in:
CoRR (2021)
Keyphrases
</>
human head
real time
fuzzy logic
training samples
physical parameters
search engine
website
artificial neural networks
supervised learning
online learning
training algorithm
head pose estimation
gaze direction