Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification.
Saurabh KatariaJesús VillalbaLaureano Moro-VelázquezPiotr ZelaskoNajim DehakPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2024)
Keyphrases
- super resolution
- speaker verification
- speaker recognition
- low resolution
- noisy environments
- high resolution
- audio visual
- prosodic features
- low resolution images
- motion estimation
- speech recognition
- super resolution reconstruction
- high quality
- emotion recognition
- multilayer perceptron
- depth map
- speech signal
- speaker identification
- image patches
- noise reduction
- language model
- image data