Classification of Distributed Data Using Topic Modeling and Maximum Variation Sampling.
Robert M. PattonJustin M. BeaverThomas E. PotokPublished in: HICSS (2011)
Keyphrases
- distributed data
- topic modeling
- topic models
- text classification
- pattern recognition
- data sharing
- feature selection
- machine learning
- latent dirichlet allocation
- image classification
- decision trees
- support vector
- communication cost
- text mining
- document classification
- neural network
- data mining algorithms
- databases
- collaborative filtering
- data sources
- training set
- preprocessing