Intra-Sentential Speaking Rate Control in Neural Text-To-Speech for Automatic Dubbing.
Mayank SharmaYogesh VirkarMarcello FedericoRoberto Barra-ChicoteRobert EnyediPublished in: Interspeech (2021)
Keyphrases
- rate control
- text to speech
- video coding
- rate distortion
- picture quality
- bit rate
- packet loss rate
- video streaming
- visual quality
- rate control algorithm
- speech synthesis
- inter frame
- wavelet based image coding
- rate control scheme
- text to speech synthesis
- bit allocation
- video quality
- variable bit rate
- network architecture
- video codec
- coding method
- image quality
- motion estimation
- three dimensional
- step size
- human visual system
- particle swarm optimization
- cost function
- data structure