Sign in

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition.

Xize ChengLinjun LiTao JinRongjie HuangWang LinZehan WangHuangdai LiuYe WangAoxiong YinZhou Zhao
Published in: CoRR (2023)
Keyphrases