Cross-modal Retrieval for Knowledge-based Visual Question Answering.
Paul LernerOlivier FerretCamille GuinaudeauPublished in: CoRR (2024)
Keyphrases
- cross modal
- question answering
- multi modal
- multimedia retrieval
- passage retrieval
- information retrieval
- image retrieval
- visual similarity
- natural language
- multimedia databases
- information extraction
- visual data
- natural language processing
- question answering systems
- named entities
- visual information
- machine learning
- answer extraction
- visual features
- relevance feedback
- multimedia data
- video search
- image database
- object recognition