Streaming Parrotron for on-device speech-to-speech conversion.
Oleg RybakovFadi BiadsyXia ZhangLiyang JiangPhoenix MeadowlarkShivani AgrawalPublished in: INTERSPEECH (2023)
Keyphrases
- speech recognition
- speech signal
- speech synthesis
- speaker recognition
- text to speech
- audio visual
- automatic speech recognition
- emotion recognition
- spoken language
- endpoint detection
- recognition engine
- english text
- speech processing
- automatic speech recognition systems
- genetic algorithm
- vocal tract
- multi lingual
- broadcast news
- human computer interaction