Using self-supervised word segmentation in Chinese information retrieval.
Fuchun PengXiangji HuangDale SchuurmansNick CerconeStephen E. RobertsonPublished in: SIGIR (2002)
Keyphrases
- word segmentation
- information retrieval
- language modeling
- chinese text
- n gram
- language model
- handwriting recognition
- word recognition
- information retrieval systems
- document analysis
- text classification
- query expansion
- chinese word segmentation
- language independent
- search engine
- retrieval model
- document collections
- pos tagging
- cross lingual
- information extraction
- unknown words
- text mining
- statistical language modeling
- chinese text retrieval
- document retrieval
- information access
- sparse data
- knowledge discovery
- digital libraries
- image segmentation