Sign in

Multi-scale Speaker Diarization with Dynamic Scale Weighting.

Taejin ParkNithin Rao KoluguriJagadeesh BalamBoris Ginsburg
Published in: CoRR (2022)
Keyphrases
  • multiscale
  • speaker diarization
  • scale space
  • pattern recognition
  • machine learning
  • learning algorithm
  • image representation
  • information retrieval
  • feature extraction
  • multi modal
  • audio stream