OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese.

Published in: CoRR (2023)

Keyphrases