Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering.
Paul LernerOlivier FerretCamille GuinaudeauPublished in: CoRR (2023)
Keyphrases
- question answering
- information extraction
- question classification
- information retrieval
- named entities
- natural language processing
- visual information
- qa clef
- semantic roles
- multi modal
- cross language
- passage retrieval
- natural language
- syntactic information
- qa systems
- open domain question answering
- data mining
- relation extraction
- question answering systems
- candidate answers
- natural language questions
- document collections
- visual data
- visual features
- low level
- keywords
- textual entailment recognition