Login / Signup
Crowdsourcing a Dataset of Audio Captions.
Samuel Lipping
Konstantinos Drossos
Tuomas Virtanen
Published in:
CoRR (2019)
Keyphrases
</>
multimedia
visual features
benchmark datasets
database
signal processing
visual information
audio video
audio visual