Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.
Yuto NishimuraYuki SaitoShinnosuke TakamichiKentaro TachibanaHiroshi SaruwatariPublished in: CoRR (2022)
Keyphrases
- end to end
- speech synthesis
- prosodic features
- speech recognition
- text to speech
- natural language
- dialogue system
- ad hoc networks
- wireless ad hoc networks
- language generation
- congestion control
- high bandwidth
- scalable video
- vocal tract
- speaker verification
- linguistic features
- real time
- application layer
- neural network
- content delivery
- admission control
- multipath
- web services
- information retrieval
- transport layer
- packet loss rate
- real world