Strategies for Language Model Web-Data Collection.
Vincent WanThomas HainPublished in: ICASSP (1) (2006)
Keyphrases
- language model
- data collection
- language modeling
- n gram
- document retrieval
- probabilistic model
- information retrieval
- speech recognition
- retrieval model
- test collection
- web pages
- statistical language models
- query expansion
- language modelling
- context sensitive
- web documents
- mixture model
- query terms
- web resources
- vector space model
- ad hoc information retrieval
- translation model
- data analysis
- machine learning
- word clouds
- relevance model
- smoothing methods
- language models for information retrieval
- language model for information retrieval
- statistical language modeling
- document length
- query specific
- statistical machine translation
- anchor text
- linked data
- question answering
- data streams