Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.
Yuto NishimuraYuki SaitoShinnosuke TakamichiKentaro TachibanaHiroshi SaruwatariPublished in: INTERSPEECH (2022)
Keyphrases
- end to end
- speech synthesis
- prosodic features
- speech recognition
- text to speech
- natural language
- dialogue system
- language generation
- high bandwidth
- ad hoc networks
- multipath
- wireless ad hoc networks
- real time
- admission control
- vocal tract
- scalable video
- congestion control
- content delivery
- application layer
- internet protocol
- pattern recognition
- video sequences