Login / Signup

Multimodal Visual-Semantic Representations Learning for Scene Text Recognition.

Xinjian GaoYe PangYuyu LiuMaokun HanJun YuWei WangYuanxu Chen
Published in: ACM Trans. Multim. Comput. Commun. Appl. (2024)
Keyphrases
  • semantic representations
  • knowledge base
  • multimedia
  • information extraction
  • multi modal
  • learning tasks
  • scene text recognition