OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese.
Nghia Hieu NguyenDuong T. D. VoKiet Van NguyenNgan Luu-Thuy NguyenPublished in: CoRR (2023)
Keyphrases
- question answering
- natural language processing
- information retrieval
- named entities
- question classification
- information extraction
- syntactic information
- question answering systems
- cross language
- natural language
- qa clef
- passage retrieval
- multimodal fusion
- natural language questions
- named entity recognition
- visual information
- visual features
- probabilistic model
- learning process
- high dimensional
- answering questions
- multimedia
- artificial intelligence