Sign in

OpenViVQA: Task, dataset, and multimodal fusion models for visual question answering in Vietnamese.

Nghia Hieu NguyenDuong T. D. VoKiet Van NguyenNgan Luu-Thuy Nguyen
Published in: Inf. Fusion (2023)
Keyphrases