Semantically Enhanced Text Stemmer (SETS) for Document Clustering.
Ivan StankovDiman TodorovRossitza SetchiPublished in: KES (2012)
Keyphrases
- document clustering
- semantically enhanced
- document representation
- text documents
- text mining
- document corpus
- language model
- text data
- tf idf
- vector space model
- keywords
- information retrieval
- text classification
- web documents
- clustering method
- clustering algorithm
- semantic information
- document collections
- text categorization
- information extraction
- bag of words
- wordnet
- topic models
- knowledge discovery
- named entities
- machine learning
- databases
- document retrieval
- knowledge representation
- cross lingual
- k means
- high dimensional
- multimedia
- computer vision
- data mining