SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities.
Boyuan ChenZhuo XuSean KirmaniBrian IchterDanny DriessPete FlorenceDorsa SadighLeonidas J. GuibasFei XiaPublished in: CoRR (2024)
Keyphrases
- language model
- spatial reasoning
- language modeling
- spatial relations
- n gram
- speech recognition
- language modelling
- document retrieval
- computer vision
- probabilistic model
- query expansion
- temporal reasoning
- information retrieval
- retrieval model
- spatial knowledge
- statistical language models
- topological relations
- document ranking
- directional relations
- mixture model
- language models for information retrieval
- smoothing methods
- context sensitive
- translation model
- vector space model
- pseudo relevance feedback
- test collection
- qualitative spatial reasoning
- qualitative and quantitative
- spatial information
- search engine