Sign in

A multimodal attention fusion network with a dynamic vocabulary for TextVQA.

Jiajia WuJun DuFengren WangChen YangXinzhe JiangJinshui HuBing YinJianshu ZhangLirong Dai
Published in: Pattern Recognit. (2022)
Keyphrases