Login / Signup
Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts.
Allen Kim
Charuta Pethe
Naoya Inoue
Steven Skiena
Published in:
EMNLP (Findings) (2021)
Keyphrases
</>
document images
data processing
scanned documents
preprocessing
optical character recognition
data sets
real time
recognition errors
scanned images
text documents
information processing
post processing
databases
end to end
text processing
document processing
scientific papers
machine learning