Methods on Identifying Data Anomalies in a Normalized Database.
Tennyson X. ChenMartin D. MeyerNanthini GanapathiSean Shuangquan LiuJon CirellaPublished in: SEDE (2010)
Keyphrases
- database
- data sets
- high dimensional data
- statistical methods
- human experts
- data mining methods
- stored data
- data analysis
- noisy data
- spectral clustering
- missing values
- data points
- image data
- data quality
- end users
- raw data
- data cleaning
- missing data
- high quality
- data collection
- machine learning methods
- data objects
- data representations
- data mining applications
- complex structures
- multiple databases
- statistical databases
- data reduction
- database queries
- statistical tests
- database systems
- training data
- statistical analysis
- data sources
- preprocessing