Login / Signup
Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation.
Federico Landi
Lorenzo Baraldi
Marcella Cornia
Massimiliano Corsini
Rita Cucchiara
Published in:
CoRR (2019)
Keyphrases
</>
multi modal
computer vision
vision system
multi modality
audio visual
image annotation
cross modal
image processing
video search
machine learning
markov random field
semantic concepts
multiple modalities
single modality