• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Speaker Diarization of Scripted Audiovisual Content.

Yogesh VirkarBrian ThompsonRohit PaturiSundararajan SrinivasanMarcello Federico
Published in: CoRR (2023)
Keyphrases
  • speaker diarization
  • multimedia content
  • multimedia
  • metadata
  • multi modal
  • multimedia data
  • neural network
  • information retrieval
  • decision trees
  • maximum likelihood
  • context aware
  • visual information
  • audio stream