Login / Signup
LT3 at SemEval-2021 Task 6: Using Multi-Modal Compact Bilinear Pooling to Combine Visual and Textual Understanding in Memes.
Pranaydeep Singh
Els Lefever
Published in:
SemEval@ACL/IJCNLP (2021)
Keyphrases
</>
multi modal
cross modal
visual representations
video search
single modality
high dimensional
visual information
multi modality
audio visual
visual representation
multimedia
image annotation
auto annotation
multiple modalities
visual cues
word sense disambiguation
information processing
graph cuts
low level
keywords