Frequency Estimates for Statistical Word Similarity Measures.
Egidio L. TerraCharles L. A. ClarkePublished in: HLT-NAACL (2003)
Keyphrases
- similarity measure
- confidence intervals
- statistical analysis
- information theoretic
- data driven
- mutual information
- low frequency
- statistical models
- feature vectors
- co occurrence
- data sets
- natural language
- similarity search
- wavelet transform
- similarity function
- semantic similarity
- natural language text
- statistical inference
- similarity metrics
- multiword
- linguistic information
- similarity assessment
- confidence bounds