Sign in

Clotho: an Audio Captioning Dataset.

Konstantinos DrossosSamuel LippingTuomas Virtanen
Published in: ICASSP (2020)
Keyphrases
  • multimedia
  • information retrieval
  • signal processing
  • learning algorithm
  • object recognition
  • benchmark datasets
  • visual information
  • training dataset