Sign in

VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning.

Qiu-Shi ZhuLong ZhouZiqiang ZhangShujie LiuBinxing JiaoJie ZhangLirong DaiDaxin JiangJinyu LiFuru Wei
Published in: CoRR (2022)
Keyphrases