Multi-granularity Text Representation and Transformer-Based Fusion Method for Visual Question Answering.
Xingang WangXiaoyu LiuXiaomin LiJinan CuiPublished in: CSCWD (2023)
Keyphrases
- question answering
- fusion method
- information fusion
- data fusion
- fusion methods
- information retrieval
- image fusion
- multi sensor
- information extraction
- natural language
- natural language processing
- visual information
- visual features
- text classification
- text retrieval
- text categorization
- low level
- high level
- vector space model
- document representation
- image processing