Domain Based Punjabi Text Document Clustering.
Saurabh SharmaVishal GuptaPublished in: COLING (Demos) (2012)
Keyphrases
- document clustering
- text documents
- text mining
- text clustering
- automatic categorization
- document corpus
- document categorization
- document representation
- clustering algorithm
- negative matrix factorization
- document collections
- clustering method
- tf idf
- topic extraction
- text analysis
- vector space model
- topic detection
- text collections
- document clusters
- keywords
- information retrieval
- k means
- document classification
- tolerance rough set
- text data
- text retrieval
- wordnet
- information extraction
- named entities
- text categorization
- text classification
- automatic summarization
- data analysis
- bag of words
- question answering
- web documents
- natural language processing
- similarity measure
- artificial intelligence