TLex+: A Hybrid Method Using Conditional Random Fields and Dictionaries for Thai Word Segmentation.
Sarawoot KongyoungAnocha RugchatjaroenKrit KosawatPublished in: KICSS (2015)
Keyphrases
- word segmentation
- hybrid method
- n gram
- sparse representation
- feature space
- language modeling
- cross lingual
- chinese word segmentation
- hybrid algorithm
- chinese text retrieval
- chinese text
- support vector machine
- language independent
- document analysis
- named entity recognition
- text classification
- machine learning
- feature selection
- learning algorithm