Creating and Using Ground Truth OCR Sample Data for Finnish Historical Newspapers and Journals.
Kimmo KettunenJukka KervinenMika KoistinenPublished in: DHN (2018)
Keyphrases
- ground truth
- data sets
- historical data
- data structure
- data collection
- raw data
- high quality
- data distribution
- data points
- database
- data processing
- data samples
- test data
- synthetic data
- prior knowledge
- image data
- ground truth data
- decision trees
- human subjects
- data quality
- training data
- data analysis
- data mining techniques
- preprocessing
- experimental data
- sensor data
- data objects
- high dimensional
- statistical analysis
- database systems
- computer systems