Sign in

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA.

Yongxin ZhuZhen LiuYukang LiangXin LiHao LiuChangcun BaoLinli Xu
Published in: CoRR (2023)
Keyphrases
  • bounding box
  • text regions
  • computer vision
  • image database
  • vision system
  • object segmentation
  • test images
  • object categories
  • object classes
  • object recognition
  • particle filtering
  • image processing
  • target object