Scaled Log Likelihood Ratios for the Detection of Abbreviations in Text Corpora.
Tibor KissJan StrunkPublished in: COLING (2002)
Keyphrases
- log likelihood
- text corpora
- density estimation
- maximum likelihood
- information theoretic
- text analysis
- text mining
- scoring function
- topic models
- em algorithm
- detection algorithm
- text documents
- search engine
- concept hierarchy
- computational linguistics
- artificial intelligence
- data mining
- probabilistic model
- closed form
- feature vectors
- topic modeling
- digital libraries
- learning algorithm