A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.
Wen-Chin HuangBenjamin PeloquinJustine KaoChanghan WangHongyu GongElizabeth SaleskyYossi AdiAnn LeePeng-Jen ChenPublished in: CoRR (2023)
Keyphrases
- speech recognition
- language acquisition
- human communication
- spoken language
- endpoint detection
- speech synthesis
- audio visual
- speech signal
- text to speech
- automatic speech recognition
- pattern recognition
- hearing impaired
- speaker recognition
- quantitative evaluation
- noisy environments
- speaker identification
- tcp ip
- multi party
- evaluation method
- machine translation
- hand movements
- lightweight
- information retrieval