ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference.
Mengcheng LanChaofeng ChenYiping KeXinjiang WangLitong FengWayne ZhangPublished in: CoRR (2024)
Keyphrases
- language learning
- computer vision
- vision system
- natural language
- programming language
- real time
- semantic representations
- conceptual graphs
- bayesian networks
- probabilistic inference
- bayesian inference
- machine learning
- computational linguistics
- specification language
- database
- relational databases
- database systems
- information retrieval
- modeling language
- probabilistic reasoning
- inference process
- structured representations