Login / Signup
Speaker Diarization of Scripted Audiovisual Content.
Yogesh Virkar
Brian Thompson
Rohit Paturi
Sundararajan Srinivasan
Marcello Federico
Published in:
CoRR (2023)
Keyphrases
</>
speaker diarization
multimedia content
multimedia
metadata
multi modal
multimedia data
neural network
information retrieval
decision trees
maximum likelihood
context aware
visual information
audio stream