Building Chinese Lexicons from Scratch by Unsupervised Short Document Self-Segmentation
Daniel Gayo-AvelloPublished in: CoRR (2004)
Keyphrases
- word segmentation
- image segmentation
- text segmentation
- text summarization
- keyword extraction
- level set
- medical images
- segmentation method
- bayesian image segmentation
- region growing
- segmentation algorithm
- fully unsupervised
- information retrieval
- unsupervised learning
- multiscale
- shape prior
- document analysis
- chinese word segmentation
- page segmentation
- document images
- document collections
- segmented images
- pos tagging
- natural language processing
- information retrieval systems
- semi supervised
- text documents
- image analysis
- segmentation of textured images
- unsupervised methods
- agglomerative clustering
- retrieval systems
- energy function