Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances
Marthinus Christoffel du PlessisMasashi SugiyamaPublished in: CoRR (2013)
Keyphrases
- data sets
- multidimensional data
- spectral clustering
- data processing
- database
- categorical data
- data sources
- original data
- clustering method
- binary vectors
- synthetic datasets
- raw data
- synthetic data
- data collection
- data structure
- unsupervised learning
- data mining techniques
- large scale data sets
- image data
- data points
- active learning
- data mining tasks
- data analysis
- decision trees
- categorical attributes
- high dimensional datasets
- gene expression profiles
- supervised classification
- high dimensional
- information theoretic
- multi class
- high dimensional data