Login / Signup

Harnessing the power of Wav2Vec2 and CNNs for Robust Speaker Identification on the VoxCeleb and LibriSpeech Datasets.

Or Haim AnidjarRevital MarbelRoi Yozevitch
Published in: Expert Syst. Appl. (2024)
Keyphrases
  • speaker identification
  • noisy environments
  • speech recognition
  • gaussian mixture model
  • speech signal
  • feature transformation
  • pattern recognition
  • k means
  • video data