Mining for the Meanings of a Murder: The Impact of OCR Quality on the Use of Digitized Historical Newspapers.
Carolyn StrangeDaniel McNamaraJosh WodakIan WoodPublished in: Digit. Humanit. Q. (2014)
Keyphrases
- high quality
- optical character recognition
- data mining
- historical manuscripts
- error correction
- knowledge discovery
- text mining
- itemsets
- higher quality
- document images
- low quality
- sequential patterns
- mining algorithm
- web mining
- quality measures
- quality assessment
- post processing
- data mining techniques
- document analysis
- information systems