Login / Signup
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training.
Yuechen Yu
Yulin Li
Chengquan Zhang
Xiaoqiang Zhang
Zengyuan Guo
Xiameng Qin
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
Published in:
CoRR (2023)
Keyphrases
</>
document images
document image analysis
document analysis
visual information
document processing
scanned documents
document image understanding
visual features
language identification
page layout
multimedia
page segmentation
word level
printed text
line extraction
optical character recognition
object recognition