Unsupervised Lemmatization as Embeddings-Based Word Clustering.
Rudolf RosaZdenek ZabokrtskýPublished in: CoRR (2019)
Keyphrases
- unsupervised learning
- clustering algorithm
- unsupervised classification
- supervised classification
- unsupervised manner
- clustering method
- unsupervised clustering
- k means
- completely unsupervised
- co occurrence
- agglomerative clustering
- data sets
- cluster analysis
- unsupervised feature selection
- information bottleneck
- cluster validation
- high dimensional data
- low dimensional
- document clustering
- data clustering
- categorical data
- syntactic categories
- pointwise mutual information
- manifold learning
- hierarchical clustering
- spectral clustering
- self organizing maps
- n gram
- supervised learning
- semi supervised
- keywords
- machine learning
- word sense disambiguation
- data points
- similarity measure