Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection.
Michael BloodgoodBenjamin StraussPublished in: CoRR (2016)
Keyphrases
- anomaly detection
- data cleaning
- data integration
- intrusion detection
- data model
- detecting anomalies
- record linkage
- intrusion detection system
- anomalous behavior
- outlier detection
- data quality
- text classification
- databases
- data warehousing
- database
- data management
- xml documents
- data sources
- xml data
- fraud detection
- data processing
- missing values
- one class support vector machines
- metadata
- detect anomalies
- social networks
- training data
- website
- web usage mining
- machine learning
- integrity constraints
- semi structured
- multi class
- feature space
- structured data
- business intelligence
- query language
- object oriented