Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering.
Yang LiuGuanbin LiLiang LinPublished in: IEEE Trans. Pattern Anal. Mach. Intell. (2023)
Keyphrases
- question answering
- cross modal
- multi modal
- perceptual information
- answering questions
- natural language processing
- visual data
- visual similarity
- information extraction
- natural language
- multimedia retrieval
- visual information
- information retrieval
- image retrieval
- named entities
- multimedia databases
- relational databases
- text classification
- machine learning
- video sequences