Sequential Embedding Induced Text Clustering, a Non-parametric Bayesian Approach.
Tiehang DuanQi LouSargur N. SrihariXiaohui XiePublished in: CoRR (2018)
Keyphrases
- text clustering
- text mining
- document clustering
- hierarchical clustering
- clustering algorithm
- text data
- vector space
- text categorization
- k means
- background knowledge
- text documents
- metric learning
- document representation
- user feedback
- self organizing maps
- wordnet
- text collections
- collaborative filtering
- logic programs
- domain knowledge
- knowledge discovery
- data analysis
- semantic relations
- probability density function
- information extraction
- co occurrence
- clustering method
- document collections