Sign in

UPC Multimodal Speaker Diarization System for the 2018 Albayzin Challenge.

Miquel Angel India MassanaItziar SagastiberriPonç PalauElisa SayrolJosep Ramon MorrosJavier Hernando
Published in: IberSPEECH (2018)
Keyphrases
  • speaker diarization
  • multi modal
  • audio stream
  • speech recognition
  • low level
  • neural network
  • computer vision
  • image processing
  • feature extraction
  • artificial neural networks
  • generative model
  • visual features