Login / Signup
MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants.
Alkesh Patel
Joel Ruben Antony Moniz
Roman Nguyen
Nick Tzou
Hadas Kotek
Vincent Renkens
Published in:
CoRR (2021)
Keyphrases
</>
multi modal
cross modal
visual representation
multimodal information
benchmark datasets
visual information
visual features
data sets
visual analysis
visual perception
visual cues
audio visual
image sequences
information retrieval
visual representations
multimodal interaction
database