Sign in

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

Rongjie HuangHuadai LiuXize ChengYi RenLinjun LiZhenhui YeJinzheng HeLichao ZhangJinglin LiuXiang YinZhou Zhao
Published in: ACL (1) (2023)
Keyphrases
  • audio visual
  • multi modal
  • visual information
  • emotion recognition
  • multi stream
  • multimedia
  • visual data
  • audio features
  • person authentication
  • speaker verification