Sign in

An Empirical Study of Multilingual Scene-Text Visual Question Answering.

Lin LiHaohan ZhangZeqin Fang
Published in: NarSUM@MM (2023)
Keyphrases