Login / Signup

ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese.

Khiem Vinh TranHao Phu PhanKiet Van NguyenNgan Luu-Thuy Nguyen
Published in: Multim. Syst. (2024)
Keyphrases