Clustering by Authorship Within and Across Documents.
Efstathios StamatatosMichael TschuggnallBen VerhoevenWalter DaelemansGünther SpechtBenno SteinMartin PotthastPublished in: CLEF (Working Notes) (2016)
Keyphrases
- document clustering
- clustering algorithm
- information retrieval
- document collections
- clustering method
- text clustering
- document retrieval
- web documents
- k means
- relevant documents
- text documents
- hierarchical clustering
- topic discovery
- topic detection
- xml documents
- data points
- information retrieval systems
- information theoretic
- cluster analysis
- text mining
- spectral clustering
- unsupervised learning
- vector space model
- document representation
- cosine similarity
- database
- document clusters
- content similarity
- user queries