Login / Signup

LaTr: Layout-Aware Transformer for Scene-Text VQA.

Ali Furkan BitenRon LitmanYusheng XieSrikar AppalarajuR. Manmatha
Published in: CVPR (2022)
Keyphrases
  • scene text
  • text detection
  • natural scene images
  • image database
  • video database
  • computer vision
  • complex background
  • scene images