Login / Signup
Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint Descriptions.
Henrik Voigt
Jan N. Hombeck
Monique Meuschke
Kai Lawonn
Sina Zarrieß
Published in:
CoRR (2023)
Keyphrases
</>
viewpoint
computer vision
statistical models
programming language
probabilistic model
deep learning
language learning
experimental data
complex systems
d objects
object recognition
natural language
image processing
real time
database
model selection
multi agent
data sets