Incorporate Web Search Technology to Solve Out-of-Vocabulary Words in Chinese Word Segmentation.
Wei QiaoMaosong SunPublished in: PACLIC (2009)
Keyphrases
- word segmentation
- out of vocabulary
- chinese word segmentation
- web search
- pos tagging
- n gram
- language specific
- language independent
- language model
- cross lingual
- search engine
- cross language information retrieval
- text classification
- language modeling
- web search engines
- word level
- named entity recognition
- machine learning
- machine translation
- part of speech
- document analysis
- parallel corpora
- probabilistic model
- web pages