Authorship Clustering using TF-IDF weighted Word-Embeddings.
Lucky AgarwalKartik ThakralGaurav BhattAnkush MittalPublished in: FIRE (2019)
Keyphrases
- tf idf
- document clustering
- term frequency
- term weighting
- weighting schemes
- stop words
- cosine similarity
- weighting scheme
- inverse document frequency
- clustering algorithm
- vector space model
- information retrieval
- retrieval model
- text categorization
- text documents
- document frequency
- vector space
- clustering method
- k means
- n gram
- ranking algorithm
- text mining
- document representation
- information retrieval systems
- keywords
- data points
- unsupervised learning
- high dimensional data
- knowledge discovery
- supervised learning
- topic models
- document collections
- feature selection
- data mining