Login / Signup
Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR Features.
Jill P. Naiman
Peter K. G. Williams
Alyssa Goodman
Published in:
CoRR (2022)
Keyphrases
</>
feature vectors
text extraction
co occurrence
automatically extracted
post processing
classification accuracy
feature extraction
feature space
binary images
automatic extraction
low level
image features
generative model
character recognition
preprocessing
optical character recognition
similarity measure