Login / Signup
Multimodal Visual-Semantic Representations Learning for Scene Text Recognition.
Xinjian Gao
Ye Pang
Yuyu Liu
Maokun Han
Jun Yu
Wei Wang
Yuanxu Chen
Published in:
ACM Trans. Multim. Comput. Commun. Appl. (2024)
Keyphrases
</>
semantic representations
knowledge base
multimedia
information extraction
multi modal
learning tasks
scene text recognition