An ensemble approach for text document clustering using Wikipedia concepts.
Seyednaser NourashrafeddinEvangelos E. MiliosDirk V. ArnoldPublished in: ACM Symposium on Document Engineering (2014)
Keyphrases
- document clustering
- text documents
- text mining
- text clustering
- automatic categorization
- external knowledge
- document categorization
- clustering algorithm
- document corpus
- document collections
- topic detection
- clustering method
- tf idf
- document representation
- keywords
- information retrieval
- text data
- automatic summarization
- document clusters
- k means
- vector space model
- text analysis
- text retrieval
- information extraction
- data mining
- text collections
- text classification
- machine learning
- text categorization
- related documents
- document classification
- natural language processing
- artificial intelligence
- databases
- tolerance rough set