Login / Signup
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA.
Ronghang Hu
Amanpreet Singh
Trevor Darrell
Marcus Rohrbach
Published in:
CoRR (2019)
Keyphrases
</>
prediction accuracy
prediction algorithm
prediction model
prediction error
data structure
long term
multi modal
image sequences
neural network
real world