SVTS: Scalable Video-to-Speech Synthesis.

Rodrigo Mira Alexandros Haliassos Stavros Petridis Björn W. Schuller Maja Pantic

Published in: CoRR (2022)

Keyphrases

speech synthesis
scalable video
end to end
speech recognition
bitstream
text to speech
video transmission over wireless
video quality
vocal tract
video streaming
joint source and channel coding
scalable video coding
video transmission
bit rate
base layer
frame rate
coding scheme
computer vision
optical flow
feature extraction