Login / Signup
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.
Rongjie Huang
Huadai Liu
Xize Cheng
Yi Ren
Linjun Li
Zhenhui Ye
Jinzheng He
Lichao Zhang
Jinglin Liu
Xiang Yin
Zhou Zhao
Published in:
CoRR (2023)
Keyphrases
</>
audio visual
multi modal
emotion recognition
visual information
multi stream
visual data
multimedia
audio features
speaker verification
audio visual speech recognition
image data
data sources
information extraction