WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions.
Jun RekimotoPublished in: CoRR (2023)
Keyphrases
- text to speech
- real time
- speech recognition
- emotion recognition
- speech quality
- speech recognition errors
- speech synthesis
- speech signal
- voice activity detection
- speech sounds
- fundamental frequency
- automatic speech recognition
- audio visual
- synthesized speech
- neural network
- real time systems
- human computer interaction
- genetic algorithm