Sign in

Multi-level, multi-modal interactions for visual question answering over text in images.

Jincai ChenSheng ZhangJiangfeng ZengFuhao ZouYuan-Fang LiTao LiuPing Lu
Published in: World Wide Web (2022)
Keyphrases