Login / Signup

Noise-Tolerant Self-Supervised Learning for Audio-Visual Voice Activity Detection.

Ui-Hyun Kim
Published in: Interspeech (2021)
Keyphrases
  • noise tolerant
  • audio visual
  • noisy data
  • learning tasks
  • data sets
  • computer vision
  • supervised learning
  • multi modal
  • e learning
  • metadata
  • multimedia
  • multiscale
  • feature vectors
  • hidden markov models
  • image data