Sign in

Unsupervised cross-modal audio representation learning from unstructured multilingual text.

Alexander SchindlerSergiu GordeaPeter Knees
Published in: SAC (2020)
Keyphrases
  • cross modal
  • perceptual information
  • multi modal
  • visual recognition
  • supervised learning
  • learning algorithm
  • multimedia
  • keywords
  • digital libraries
  • e learning
  • image retrieval
  • text mining