Cross-Modal Generative Augmentation for Visual Question Answering.
Zixu WangYishu MiaoLucia SpeciaPublished in: BMVC (2021)
Keyphrases
- question answering
- cross modal
- multi modal
- image retrieval
- multimedia retrieval
- information retrieval
- natural language
- visual similarity
- natural language processing
- named entities
- visual data
- multimedia databases
- visual recognition
- information extraction
- generative model
- image representation
- image understanding
- knowledge base
- visual content