Non-parametric detection of meaningless distances in high dimensional data.
Ata KabánPublished in: Stat. Comput. (2012)
Keyphrases
- high dimensional data
- dimensionality reduction
- high dimensional
- low dimensional
- subspace clustering
- high dimensionality
- nearest neighbor
- high dimensions
- similarity search
- data points
- data sets
- original data
- high dimensional datasets
- dimension reduction
- input space
- clustering high dimensional data
- nonlinear dimensionality reduction
- distance measure
- data analysis
- data distribution
- manifold learning
- detection algorithm
- high dimensional spaces
- variable selection
- high dimensional data sets
- lower dimensional
- linear discriminant analysis
- euclidean distance
- subspace learning
- text data
- database
- small sample size
- sparse representation
- distance function