On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering.
Ankur SikarwarGabriel KreimanPublished in: CoRR (2022)
Keyphrases
- question answering
- question classification
- information retrieval
- natural language processing
- natural language
- natural language questions
- information extraction
- named entities
- qa clef
- visual information
- passage retrieval
- syntactic information
- relation extraction
- cross language
- sentence retrieval
- question answering systems
- semantic roles
- machine learning
- low level
- answer validation
- candidate answers
- visual features
- multi modal