Login / Signup
Combining OCR Outputs for Logical Document Structure Markup. Technical Background to the ACL 2012 Contributed Task.
Ulrich Schäfer
Benjamin Weitz
Published in:
Discoveries@ACL (2012)
Keyphrases
</>
document structure
xml documents
text summarization
relevant documents
structured documents
semantic information
document images
document representation
optical character recognition
machine learning
databases
information extraction
information retrieval systems
data fusion
hierarchical structures