Login / Signup
A Novel Attention-based Aggregation Function to Combine Vision and Language.
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
Published in:
CoRR (2020)
Keyphrases
</>
programming language
computer vision
language learning
real time
vision system
visual attention
specification language
language processing
database
social networks
image processing
image segmentation
natural language
aggregation functions