Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
Ali VosoughiShijian DengSongyang ZhangYapeng TianChenliang XuJiebo LuoPublished in: IEEE Trans. Multim. (2024)
Keyphrases
- question answering
- visual information
- information retrieval
- question classification
- natural language
- natural language processing
- visual features
- passage retrieval
- image database
- named entities
- information extraction
- video database
- syntactic information
- question answering systems
- answer extraction
- open domain question answering
- natural language questions
- low level
- bayesian networks
- relation extraction
- multi modal
- qa clef
- answer validation
- test set
- qa systems
- knowledge representation
- textual entailment recognition