Enhanced cross-domain document clustering with a semantically enhanced text stemmer (SETS).
Ivan StankovDiman TodorovRossitza SetchiPublished in: Int. J. Knowl. Based Intell. Eng. Syst. (2013)
Keyphrases
- document clustering
- cross domain
- semantically enhanced
- document representation
- text documents
- text categorization
- text mining
- transfer learning
- text data
- text classification
- language model
- knowledge transfer
- vector space model
- keywords
- web documents
- document collections
- information extraction
- information retrieval
- clustering algorithm
- bag of words
- semantic information
- named entities
- clustering method
- target domain
- n gram
- topic models
- feature selection
- image classification
- vector space
- prior knowledge
- artificial intelligence
- databases
- data mining