Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering.
Tao LiLinjun ShouXuejun LiuPublished in: CoRR (2024)
Keyphrases
- multi modal
- question answering
- cross modal
- information extraction
- audio visual
- multi modality
- passage retrieval
- information retrieval
- named entities
- natural language
- question classification
- cross language
- answering questions
- qa clef
- visual information
- natural language processing
- video search
- single modality
- semantic roles
- question answering systems
- syntactic information
- high dimensional
- knowledge base
- natural language questions
- uni modal
- multiple modalities
- visual data
- qa systems
- visual features
- feature extraction
- high level
- metadata
- feature selection