OpenViVQA: Task, dataset, and multimodal fusion models for visual question answering in Vietnamese.

Published in: Inf. Fusion (2023)

Keyphrases