Sign in

Unsupervised active speaker detection in media content using cross-modal information.

Rahul SharmaShrikanth Narayanan
Published in: CoRR (2022)
Keyphrases
  • cross modal
  • multi modal
  • multimedia data
  • multimedia databases
  • databases
  • information retrieval
  • keywords
  • information extraction
  • language model