MaskOCR: Scene Text Recognition with Masked Vision-Language Pre-training.
Pengyuan LyuChengquan ZhangShanshan LiuMeina QiaoYangliu XuLiang WuKun YaoJunyu HanErrui DingJingdong WangPublished in: Trans. Mach. Learn. Res. (2024)
Keyphrases
- scene text recognition
- language learning
- real time
- training samples
- computer software
- training phase
- vision system
- object recognition
- natural language
- image processing
- training set
- programming language
- online learning
- training examples
- test set
- support vector
- training process
- formal language
- computational vision
- case study
- language processing
- feedforward neural networks
- databases