A Clustering Algorithm of No-Word-Segmentation for Chinese Search Engine.
Deqing WangHui ZhangLiping ZhaoKe XiePublished in: SKG (2007)
Keyphrases
- word segmentation
- search engine
- clustering algorithm
- chinese text
- word recognition
- n gram
- web search
- language independent
- handwriting recognition
- text classification
- web search engines
- information retrieval
- chinese word segmentation
- k means
- chinese text retrieval
- cross lingual
- unknown words
- document clustering
- user queries
- web pages
- keywords
- word level
- pos tagging
- cluster analysis
- language modeling
- handwritten documents
- information access
- clustering method
- data analysis
- knn
- machine learning
- probabilistic model
- document analysis
- retrieval systems