Login / Signup

Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning.

Nikhil SinghChih-Wei WuIroro OrifeMahdi M. Kalayeh
Published in: CoRR (2023)
Keyphrases
  • cross modal
  • multi modal
  • visual recognition
  • learning algorithm
  • perceptual information
  • learning tasks
  • statistical learning
  • supervised learning
  • multimedia retrieval