Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion.
Tuan Vu HoMaori KobayashiMasato AkagiPublished in: CoRR (2022)
Keyphrases
- text to speech
- emotion recognition
- speech recognition
- voice activity detection
- fundamental frequency
- speech quality
- pattern recognition
- biologically inspired
- speech synthesis
- speech sounds
- digital photography
- speech recognition errors
- practical experience
- text to speech synthesis
- synthesized speech
- prosodic features
- database
- multi modal
- databases
- real time