Cluster-clean-label: an interactive machine learning approach for labeling high-dimensional data.
David BeilAndreas TheisslerPublished in: VINCI (2020)
Keyphrases
- high dimensional data
- machine learning
- subspace clustering
- data points
- data analysis
- dimensionality reduction
- high dimensional
- variable weighting
- low dimensional
- high dimensionality
- labeling process
- high dimensions
- nearest neighbor
- data sets
- active learning
- similarity search
- clustering algorithm
- unsupervised learning
- input space
- clustering high dimensional data
- manifold learning
- high dimensional spaces
- original data
- subspace clusters
- data mining
- high dimensional datasets
- decision trees
- dimension reduction
- pattern recognition
- labeled data
- semi supervised learning
- model selection
- output space
- neural network
- knowledge discovery
- data clustering
- text data
- learning algorithm
- nonlinear dimensionality reduction
- database
- dimensional data
- linear discriminant analysis
- text classification
- text mining
- feature space
- computer vision
- multi label
- euclidean distance
- input data