Sign in

Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision.

Soo-Whan ChungHong-Goo KangJoon Son Chung
Published in: INTERSPEECH (2020)
Keyphrases
  • cross modal
  • multi modal
  • learning algorithm
  • active learning
  • visual recognition
  • learning tasks
  • supervised learning
  • dimensionality reduction
  • perceptual information