Login / Signup

Speaker Diarization of Scripted Audiovisual Content.

Yogesh VirkarBrian ThompsonRohit PaturiSundararajan SrinivasanMarcello Federico
Published in: CoRR (2023)
Keyphrases
  • speaker diarization
  • multimedia content
  • multimedia
  • metadata
  • multi modal
  • multimedia data
  • neural network
  • information retrieval
  • decision trees
  • maximum likelihood
  • context aware
  • visual information
  • audio stream