Probabilistic Noise Identification and Data Cleaning.
Jeremy KubicaAndrew W. MoorePublished in: ICDM (2003)
Keyphrases
- data cleaning
- data integration
- text classification
- record linkage
- outlier detection
- data quality
- database
- data processing
- data warehousing
- missing values
- data warehouse
- web usage mining
- bayesian networks
- missing data
- fraud detection
- information extraction
- data management
- low dimensional
- response time
- query evaluation
- information retrieval
- machine learning