Sign in

ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model for Visual Question Answering in Vietnamese.

Khiem Vinh TranHao Phu PhanKiet Van NguyenNgan Luu-Thuy Nguyen
Published in: CoRR (2023)
Keyphrases