Read, Spell and Repeat: Scene Text Recognition with Vision-Language Circular Refinement.
Taiwei ZhangZhenghui HuWeixin LiQingjie LiuYunhong WangPublished in: ICASSP (2024)
Keyphrases
- scene text recognition
- programming language
- object recognition
- language learning
- computer vision
- image processing
- vision system
- data sets
- english language
- active vision
- hough transform
- general purpose
- knowledge representation
- real time
- multi agent
- visual perception
- object oriented programming
- specification language
- linguistic knowledge
- information retrieval
- data mining
- databases