Improving Document Clustering Performance: The Use of an Automatically Generated Ontology to Augment Document Representations.
Stephen BradshawColm O'RiordanDaragh BradshawPublished in: KDIR (2017)
Keyphrases
- automatically generated
- document clustering
- document representation
- text representation
- document collections
- vector space model
- document similarity
- text documents
- semantic information
- text mining
- clustering algorithm
- domain knowledge
- clustering method
- knowledge representation
- domain specific
- bag of words
- document retrieval
- cluster analysis
- data analysis
- artificial intelligence
- web documents
- k nearest neighbor
- search engine