Sign in

CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes.

Maria ParelliAlexandros DelitzasNikolas HarsGeorgios VlassisSotiris AnagnostidisGregor BachmannThomas Hofmann
Published in: CoRR (2023)
Keyphrases