A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models.
Zhexin ZhangPublished in: CHI Extended Abstracts (2024)
Keyphrases
- language model
- visual information
- visual data
- visual features
- image collections
- content based image retrieval systems
- language modeling
- visual descriptors
- textual information
- visual input
- low level
- document retrieval
- visual content
- visual cues
- n gram
- probabilistic model
- information retrieval
- visual similarity
- image retrieval
- retrieval model
- eye movements
- image database
- visual scene
- image classification
- visual concepts
- image annotation
- visual and textual information
- smoothing methods
- language models for information retrieval
- audio visual
- image search
- test collection
- query expansion
- image data
- web images
- relevance model
- semantic information
- object recognition
- color histogram
- image features
- domain knowledge
- search engine