Text data acquisition for domain-specific language models.
Abhinav SethyPanayiotis G. GeorgiouShrikanth S. NarayananPublished in: EMNLP (2006)
Keyphrases
- data acquisition
- language model
- domain specific
- information retrieval
- language modeling
- n gram
- monitoring system
- document retrieval
- low cost
- high speed
- document level
- probabilistic model
- data analysis
- data processing
- data collection
- text retrieval
- retrieval model
- supervisory control
- speech recognition
- query expansion
- language modelling
- real time
- multiword
- test collection
- context sensitive
- document ranking
- relevance model
- translation model
- statistical language models
- text mining
- language models for information retrieval
- okapi bm
- pseudo relevance feedback
- keywords
- smoothing methods
- query terms
- vector space model
- feature selection
- document collections
- database
- improve retrieval effectiveness
- text documents