Web N-gram workshop 2010.
Chengxiang ZhaiKuansan WangDavid YarowskyStephan VogelEvelyne ViegasPublished in: SIGIR Forum (2010)
Keyphrases
- n gram
- language model
- web documents
- variable length
- language independent
- web pages
- bag of words
- language modelling
- web mining
- text classification
- part of speech
- search click data
- language modeling
- information access
- viterbi algorithm
- semantic web
- query expansion
- search engine
- dynamic programming
- word segmentation
- digital libraries
- inside outside algorithm