Disentangling Voice and Content with Self-Supervision for Speaker Recognition.
Tianchi LiuKong Aik LeeQiongqiong WangHaizhou LiPublished in: NeurIPS (2023)
Keyphrases
- speaker recognition
- gaussian mixture model
- mel frequency cepstral coefficients
- vector quantization
- speaker verification
- probabilistic neural network
- speaker identification
- multimedia content
- multiscale
- multimedia
- emotion recognition
- speech recognition
- partial least squares
- noise reduction
- pattern classification
- artificial neural networks
- information retrieval