Chinese word segmentation based on conditional random fields with character clustering.
Liping DuXiaoge LiChunli LiuRui LiuXian FanJianing YangDayi LinMian WeiPublished in: IALP (2016)
Keyphrases
- conditional random fields
- web page prediction
- chinese word segmentation
- probabilistic model
- hidden markov models
- graphical models
- sequence labeling
- prediction accuracy
- generative model
- higher order
- clustering algorithm
- markov random field
- information extraction
- word segmentation
- pairwise
- maximum entropy
- named entity recognition
- document clustering
- crf model
- computer vision
- k means
- unsupervised learning
- maximum likelihood
- segmentation method
- prefetching