Login / Signup
MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information.
Jianrong Wang
Yuchen Huo
Li Liu
Tianyi Xu
Qi Li
Sen Li
Published in:
CoRR (2023)
Keyphrases
</>
depth information
audio visual
emotion recognition
multi modal
depth map
rgbd images
stereo vision
visual information
depth recovery
visual data
multimedia
depth images
multi stream
audio visual speech recognition
real time
machine learning
multi view
high quality
low resolution
low level
computer vision