Sign in

OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese.

Nghia Hieu NguyenDuong T. D. VoKiet Van NguyenNgan Luu-Thuy Nguyen
Published in: CoRR (2023)
Keyphrases