Clustering lines in high-dimensional space: Classification of incomplete data.
Jie GaoMichael LangbergLeonard J. SchulmanPublished in: ACM Trans. Algorithms (2010)
Keyphrases
- incomplete data
- incomplete data sets
- missing values
- missing data
- multiple imputation
- classification accuracy
- learning bayesian networks
- bayes classifier
- em algorithm
- unsupervised learning
- bayesian networks
- decision trees
- clustering algorithm
- supervised classification
- pattern recognition
- machine learning algorithms
- clustering method
- feature extraction
- high dimensional
- feature selection
- irrelevant attributes
- feature space
- nearest neighbor classification
- attribute selection
- training set
- data points
- feature vectors
- density estimation
- classification rules
- classification algorithm
- nearest neighbor rule
- support vector machine
- model selection
- data sets