Keyphrases
- text clustering
- word level
- language independent
- text mining
- machine translation
- document clustering
- hierarchical clustering
- document images
- clustering algorithm
- n gram
- background knowledge
- k means
- text categorization
- text classification
- text data
- text documents
- user feedback
- information retrieval
- wordnet
- latent semantic analysis
- document representation
- vector space model
- information extraction
- natural language processing
- collaborative filtering
- text collections
- digital libraries
- named entities