Sign in

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model.

Yi-Jen ShihHsuan-Fu WangHeng-Jui ChangLayne BerryHung-yi LeeDavid Harwath
Published in: SLT (2022)
Keyphrases