Login / Signup

SEAR: Semantically-grounded Audio Representations.

Rajat HebbarDigbalay BoseShrikanth Narayanan
Published in: ACM Multimedia (2023)
Keyphrases
  • multimedia
  • visual data
  • audio visual
  • genetic algorithm
  • similarity measure
  • data sources
  • visual information
  • data sets
  • image processing
  • hidden markov models
  • higher level
  • visual features
  • audio recordings