Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web.
Gregory GrefenstettePublished in: CICLing (2007)
Keyphrases
- language model
- massive scale
- high dimensional
- language modeling
- natural language
- language processing
- language modelling
- n gram
- speech recognition
- information retrieval
- retrieval model
- low latency
- document retrieval
- probabilistic model
- query terms
- context sensitive
- test collection
- text mining
- natural language processing
- statistical language models
- web documents
- query expansion
- information extraction
- language models for information retrieval
- low dimensional
- real time
- smoothing methods
- vector space model
- pseudo relevance feedback
- web pages
- similarity search
- machine translation
- bayesian networks
- question answering
- linked data
- operating system
- high speed
- feature space
- machine learning