Detect2Interact: Localizing Object Key Field in Visual Question Answering with LLMs.
Jialou WangManli ZhuYulei LiHonglei LiLongzhi YangWai Lok WooPublished in: IEEE Intell. Syst. (2024)
Keyphrases
- question answering
- question classification
- natural language processing
- passage retrieval
- information retrieval
- natural language
- information extraction
- named entities
- visual information
- cross language
- natural language questions
- qa clef
- syntactic information
- sentence retrieval
- open domain question answering
- document retrieval
- low level
- visual data
- relation extraction
- visual features
- co occurrence
- answer validation