Combining Distributed Word Representation and Document Distance for Short Text Document Clustering.
Supavit KongwudhikunakornKitsana WaiyamaiPublished in: J. Inf. Process. Syst. (2020)
Keyphrases
- document clustering
- short text
- topic detection
- text documents
- document collections
- latent topics
- text mining
- document representation
- document clusters
- document similarity
- clustering algorithm
- tf idf
- vector space model
- clustering method
- tolerance rough set
- k means
- keywords
- term frequency
- data mining
- cluster analysis
- semantic information
- cross lingual
- n gram
- co occurrence
- information extraction
- data analysis
- metadata
- information retrieval