Distribution of content words and phrases in text and language modelling.
Slava M. KatzPublished in: Nat. Lang. Eng. (1996)
Keyphrases
- language modelling
- n gram
- multiword
- language model
- keywords
- word pairs
- related documents
- web documents
- noun phrases
- text documents
- ad hoc retrieval
- tf idf
- semantic content
- information retrieval
- language modeling
- text classification
- semantic information
- weighting scheme
- part of speech
- multimedia
- query expansion
- pseudo relevance feedback
- text retrieval
- retrieval model
- bag of words
- text mining
- document representation
- metadata
- bayesian networks
- digital libraries
- information extraction
- semantic similarity
- visual features