Document Image Collection Using Amazon's Mechanical Turk.
Audrey N. LeJerome AjotMark A. PrzybockiStephanie M. StrasselPublished in: Mturk@HLT-NAACL (2010)
Keyphrases
- document images
- mechanical turk
- document image analysis
- document analysis
- document image understanding
- document processing
- scanned documents
- page segmentation
- gold standard
- language identification
- optical character recognition
- word level
- text lines
- page layout
- historical documents
- digital libraries
- handwritten documents
- document layout
- ocr systems
- image processing
- machine vision
- line extraction
- printed documents
- information extraction
- detection method