Enhancing Speech-to-Speech Translation with Multiple TTS Targets.
Jiatong ShiYun TangAnn LeeHirofumi InagumaChanghan WangJuan PinoShinji WatanabePublished in: CoRR (2023)
Keyphrases
- text to speech
- speech recognition
- speech synthesis
- prosodic features
- spoken language
- multiple targets
- audio visual
- recognition engine
- language acquisition
- speech processing
- automatic speech recognition systems
- text to speech synthesis
- broadcast news
- speech recognizer
- noisy environments
- automatic speech recognition
- speech signal
- multi modal