Evaluation of vector embedding models in clustering of text documents.
Tomasz WalkowiakMateusz GniewkowskiPublished in: RANLP (2019)
Keyphrases
- text documents
- document clustering
- text mining
- text clustering
- document classification
- keywords
- text data
- clustering algorithm
- text categorization
- wordnet
- text classification
- unsupervised learning
- text representation
- named entities
- data sets
- knowledge discovery
- probabilistic model
- feature vectors
- similarity measure
- clustering method
- topic models
- bag of words
- expert systems
- vector space
- artificial intelligence
- machine learning
- data mining
- databases