Login / Signup
Lip to Speech Synthesis with Visual Context Attentional GAN.
Minsu Kim
Joanna Hong
Yong Man Ro
Published in:
CoRR (2022)
Keyphrases
</>
speech synthesis
visual context
speech recognition
temporal context
text to speech
visual attention
visual scene
object detection
scene interpretation
semantic context
visual information
scene understanding