Sign in

Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes.

Alexandros DelitzasMaria ParelliNikolas HarsGeorgios VlassisSotiris AnagnostidisGregor BachmannThomas Hofmann
Published in: CoRR (2023)
Keyphrases