Multimodal Integration of Human-Like Attention in Visual Question Answering.

Published in: CVPR Workshops (2023)

Keyphrases