Login / Signup
Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA.
Ronghang Hu
Amanpreet Singh
Trevor Darrell
Marcus Rohrbach
Published in:
CVPR (2020)
Keyphrases
</>
prediction accuracy
prediction algorithm
prediction error
prediction model
multi modal
data driven
multimodal interaction
information retrieval
motion estimation
image registration
data sets
long term
evolutionary algorithm
case study
computer vision
audio visual
correct answers
iterative methods
real world