Evaluating Better Document Representation in Clustering with Varying Complexity.
Stephen BradshawColm O'RiordanPublished in: KDIR (2018)
Keyphrases
- document representation
- document clustering
- bag of words
- clustering algorithm
- clustering method
- vector space model
- k means
- language model
- document collections
- text mining
- text documents
- vector space
- data fusion
- document content
- semantic information
- web documents
- cluster analysis
- semantic relations
- unsupervised learning
- high level
- information retrieval