A Comparison of Similarity Measures for Text Documents.
Shanmugasundaram HariharanRengaramanujam SrinivasanPublished in: J. Inf. Knowl. Manag. (2008)
Keyphrases
- text documents
- text mining
- similarity measure
- text categorization
- text classification
- keywords
- news articles
- document classification
- information extraction
- document clustering
- wordnet
- named entities
- tf idf
- topic models
- bag of words
- prior knowledge
- text data
- natural language processing
- image classification
- databases
- automatic text categorization
- unsupervised learning
- semantic similarity
- image retrieval
- multiscale
- artificial intelligence
- real world