A survey on unsupervised outlier detection in high-dimensional numerical data.
Arthur ZimekErich SchubertHans-Peter KriegelPublished in: Stat. Anal. Data Min. (2012)
Keyphrases
- outlier detection
- numerical data
- high dimensional
- high dimensional datasets
- categorical data
- fraud detection
- low dimensional
- detection algorithm
- knowledge discovery
- data streams
- data sets
- high dimensionality
- detecting outliers
- unsupervised learning
- semi supervised
- detect outliers
- density ratio estimation
- supervised learning
- numerical attributes
- data mining
- similarity search
- dimensionality reduction
- credit card fraud detection
- distance based outlier detection
- density estimation
- databases
- database
- computer vision
- multi dimensional
- nearest neighbor
- learning algorithm
- high dimensional data
- real world
- feature space
- principal component analysis
- probabilistic model