Masked Vision-Language Transformers for Scene Text Recognition.
Jie WuYing PengShengming ZhangWeigang QiJian ZhangPublished in: CoRR (2022)
Keyphrases
- scene text recognition
- computer vision
- programming language
- natural language
- object recognition
- real time
- language learning
- vision system
- data sets
- programming environment
- object oriented
- general purpose
- multiscale
- image sequences
- website
- language processing
- image processing
- specification language
- english language
- formal language
- databases