Probabilistic correlation-based similarity measure on text records.
Shaoxu SongHan ZhuLei ChenPublished in: Inf. Sci. (2014)
Keyphrases
- similarity measure
- probabilistic model
- database
- databases
- bayesian networks
- text documents
- free text
- context sensitive
- clustering method
- automatically extracted
- mutual information
- text mining
- similarity assessment
- natural language generation
- string matching
- textual data
- similarity metric
- text data
- measuring similarity
- edit distance
- data sets
- uncertain data
- semantic information
- similarity search
- text categorization
- generative model
- distance measure
- pairwise
- belief networks
- semantic similarity
- similarity matrix
- text information
- similarity computation
- high dimensional