Vocoder-Based Speech Synthesis from Silent Videos.

Daniel Michelsanti Olga Slizovskaia Gloria Haro Emilia Gómez Zheng-Hua Tan Jesper Jensen

Published in: CoRR (2020)

Keyphrases

speech synthesis
speech recognition
text to speech
prosodic features
video sequences
vocal tract
video frames
speech corpus
pattern recognition
video data
visual analysis
video surveillance
video clips
human activities
user generated
image sequences
neural network
video database
video analysis
real time
multi view
image compression
spatio temporal