Improved Unsupervised Name Discrimination with Very Wide Bigrams and Automatic Cluster Stopping.
Ted PedersenPublished in: CICLing (2009)
Keyphrases
- data driven
- unsupervised clustering
- semi automatic
- clustering algorithm
- unsupervised learning
- real time
- feature selection
- wide range
- n gram
- cluster analysis
- cluster validation
- part of speech
- hierarchical clustering
- data clustering
- fully automatic
- expectation maximization
- similarity measure
- case study
- machine learning