Login / Signup
Audio-visual Recognition of Overlapped speech for the LRS2 dataset.
Jianwei Yu
Shi-Xiong Zhang
Jian Wu
Shahram Ghorbani
Bo Wu
Shiyin Kang
Shansong Liu
Xunying Liu
Helen Meng
Dong Yu
Published in:
CoRR (2020)
Keyphrases
</>
audio visual
digit recognition
multi modal
multi stream
visual information
emotion recognition
multimedia
object recognition
speaker verification
activity recognition
feature extraction
feature set
visual data
pattern recognition
audio features
data sets
keywords
three dimensional