Lithuanian news clustering using document embeddings.
Lukas StankeviciusMantas LukoseviciusPublished in: IVUS (2019)
Keyphrases
- document clustering
- clustering algorithm
- topic detection
- tolerance rough set
- keywords
- clustering method
- k means
- text clustering
- information retrieval
- information retrieval systems
- unsupervised learning
- hierarchical clustering
- cluster analysis
- text mining
- structured documents
- categorical data
- document classification
- text documents
- retrieval systems
- low dimensional
- web documents
- data points
- cluster membership
- topic discovery
- topic detection and tracking
- high dimensional data
- self organizing maps
- spectral clustering
- data clustering
- news stories
- cosine similarity
- subspace clustering
- keyphrase extraction
- document clusters
- news articles