Login / Signup
Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events.
Wim Boes
Hugo Van hamme
Published in:
ACM Multimedia (2019)
Keyphrases
</>
weakly labeled
feature vectors
support vector
visual information
audio visual
multiscale
emotion recognition
machine learning
feature selection
multimedia
pairwise
supervised learning
feature set
music score
image classification
training samples