SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation.
Arya D. McCarthyLiezl PuzonJuan Miguel PinoPublished in: ICASSP (2020)
Keyphrases
- speech recognition
- speaker recognition
- automatic speech recognition
- audio visual
- speaker verification
- speaker identification
- speech signal
- hidden markov models
- prosodic features
- noisy environments
- semi automatic
- emotion recognition
- fully automatic
- machine translation
- speech recognizer
- speaker diarization
- recognition engine
- natural language
- automatic transcription