Keyphrases
- document clustering
- compound words
- tf idf
- text documents
- clustering algorithm
- text mining
- document collections
- clustering method
- vector space model
- term dependence
- k means
- noun phrases
- broadcast news
- automatic speech recognition
- machine learning
- cluster analysis
- information extraction
- text classification
- feature vectors
- similarity measure