Probabilistic Management of OCR Data using an RDBMS
Arun KumarChristopher RéPublished in: CoRR (2011)
Keyphrases
- data sets
- data processing
- original data
- data analysis
- synthetic data
- raw data
- data collection
- management system
- statistical analysis
- data mining techniques
- probability distribution
- data structure
- training data
- information systems
- data points
- small number
- data quality
- uncertain data
- high quality
- missing data
- stored data
- historical data
- data distribution
- experimental data
- data mining algorithms
- database management systems
- generative model
- input data
- prior knowledge
- learning algorithm
- data mining