The BigKClustering approach for document clustering using Hadoop MapReduce.
Sofia MegarchiotiBasilis MamalisPublished in: PCI (2018)
Keyphrases
- document clustering
- mapreduce framework
- cloud computing
- map reduce
- data analytics
- clustering method
- clustering algorithm
- document collections
- text mining
- text documents
- document representation
- negative matrix factorization
- vector space model
- topic extraction
- document clusters
- large scale data sets
- cluster analysis
- real world
- action recognition
- k means
- knowledge base