Probabilistic ensemble learning for vietnamese word segmentation.
Wuying LiuLi LinPublished in: SIGIR (2014)
Keyphrases
- ensemble learning
- word segmentation
- ensemble methods
- generalization ability
- n gram
- random forest
- document analysis
- language modeling
- bayesian networks
- base classifiers
- language independent
- text classification
- feature selection
- text categorization
- probabilistic model
- concept drift
- unlabeled data
- active learning
- high dimensional
- data sets
- named entity recognition
- decision trees
- support vector machine