Sign in

Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation.

Jing-Xuan ZhangGenshun WanZhen-Hua LingJia PanJianqing GaoCong Liu
Published in: ICASSP (2023)
Keyphrases