Cloud-based clustering of text documents using the GHSOM algorithm on the GridGain platform.
Martin SarnovskyZ. UlbrikPublished in: SACI (2013)
Keyphrases
- k means
- text documents
- text mining
- learning algorithm
- clustering method
- neural network
- expectation maximization
- hierarchical clustering
- wordnet
- unsupervised learning
- information extraction
- probabilistic model
- similarity measure
- artificial intelligence
- active learning
- prior knowledge
- expert systems
- clustering algorithm
- image processing
- high dimensional data
- topic models
- self organizing maps
- knowledge base