READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents.
Tobias GruningRoger LabahnMarkus DiemFlorian KleberStefan FielPublished in: DAS (2018)
Keyphrases
- detection scheme
- automatic detection
- information retrieval systems
- electronic documents
- document collections
- information retrieval
- database
- detection rate
- detection algorithm
- false positives
- benchmark datasets
- relevance judgments
- document retrieval
- text documents
- xml documents
- metadata
- detection method
- web documents
- document classification
- xml retrieval
- multi document summarization
- keywords
- rule interestingness measures
- object detection
- free text
- digital libraries