Incremental Learning for Fully Unsupervised Word Segmentation Using Penalized Likelihood and Model Selection.
Ruey-Cheng ChenPublished in: CoRR (2016)
Keyphrases
- incremental learning
- model selection
- word segmentation
- fully unsupervised
- penalized likelihood
- maximum likelihood
- em algorithm
- log likelihood
- cross validation
- n gram
- parameter estimation
- sample size
- text classification
- mixture model
- machine learning
- language independent
- document analysis
- feature selection
- expectation maximization
- supervised learning
- unsupervised learning
- learning process
- language modeling
- semi supervised
- density estimation
- model selection criteria