What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs?
Muzi TaoSaining XiePublished in: Tiny Papers @ ICLR (2024)
Keyphrases
- formal analysis
- digital images
- visual information
- cross modal
- multi modal
- multimodal information
- low level
- visual features
- agent model
- visual cues
- formal methods
- high level
- multimodal interaction
- audio visual
- ban logic
- artificial intelligence
- visual perception
- wireless sensor networks
- data model
- case study
- knowledge base