Textless Speech-to-Speech Translation on Real Data.
Ann LeeHongyu GongPaul-Ambroise DuquenneHolger SchwenkPeng-Jen ChenChanghan WangSravya PopuriYossi AdiJuan PinoJiatao GuWei-Ning HsuPublished in: NAACL-HLT (2022)
Keyphrases
- speech recognition
- audio visual
- speech signal
- recognition engine
- spoken language
- automatic speech recognition systems
- automatic speech recognition
- text to speech
- speaker recognition
- endpoint detection
- speech synthesis
- speech processing
- audio signals
- pattern recognition
- spoken dialogue systems
- dialogue system
- multi lingual
- english text
- linear prediction
- emotion recognition
- human communication
- spoken document retrieval
- spontaneous speech
- synthetic data
- multi modal
- text to speech synthesis
- information retrieval systems