A Divisive Information-Theoretic Feature Clustering Algorithm for Text Classification.
Inderjit S. DhillonSubramanyam MallelaRahul KumarPublished in: J. Mach. Learn. Res. (2003)
Keyphrases
- distributional clustering
- information theoretic
- text classification
- clustering algorithm
- information theory
- mutual information
- text categorization
- jensen shannon divergence
- theoretic framework
- information bottleneck
- multi modality
- log likelihood
- clustering method
- bag of words
- bregman divergences
- kullback leibler divergence
- feature selection
- k means
- machine learning
- kl divergence
- information theoretic measures