Fast parallel outlier detection for categorical datasets using MapReduce.
Anna KoufakouJimmy SecretanJohn ReederKelvin CardonaMichael GeorgiopoulosPublished in: IJCNN (2008)
Keyphrases
- outlier detection
- high dimensional datasets
- distance based outlier detection
- knowledge discovery
- detection algorithm
- categorical data
- density ratio estimation
- data mining
- detect outliers
- detecting outliers
- fraud detection
- density estimation
- data sets
- data streams
- credit card fraud detection
- database
- high dimensional data
- cloud computing
- least squares
- high dimensionality
- test data
- attribute values
- density based clustering