Login / Signup
Speech2Video: Cross-Modal Distillation for Speech to Video Generation.
Shijing Si
Jianzong Wang
Xiaoyang Qu
Ning Cheng
Wenqi Wei
Xinghua Zhu
Jing Xiao
Published in:
Interspeech (2021)
Keyphrases
</>
cross modal
video streams
video sequences
video data
video content
multimedia
multi modal
video frames
video retrieval
key frames
spatio temporal
space time
activity recognition
e learning
human actions
video analysis
visual data
semantic concepts