Login / Signup
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA.
Yongxin Zhu
Zhen Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
Published in:
CoRR (2023)
Keyphrases
</>
bounding box
text regions
computer vision
image database
vision system
object segmentation
test images
object categories
object classes
object recognition
particle filtering
image processing
target object