ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference.

Mengcheng Lan Chaofeng Chen Yiping Ke Xinjiang Wang Litong Feng Wayne Zhang

Published in: CoRR (2024)

Keyphrases

language learning
computer vision
vision system
natural language
programming language
real time
semantic representations
conceptual graphs
bayesian networks
probabilistic inference
bayesian inference
machine learning
computational linguistics
specification language
database
relational databases
database systems
information retrieval
modeling language
probabilistic reasoning
inference process
structured representations