Chinese Text Classification without Automatic Word Segmentation.
Wei LiuBen AllisonDavid GuthrieLouise GuthriePublished in: ALPIT (2007)
Keyphrases
- word segmentation
- text classification
- n gram
- chinese text
- language independent
- word recognition
- chinese word segmentation
- text categorization
- feature selection
- labeled data
- text documents
- language modeling
- bag of words
- machine learning
- sentiment analysis
- unlabeled data
- knn
- pos tagging
- cross lingual
- text mining
- word level
- chinese text retrieval
- unknown words
- semi supervised learning
- learning algorithm
- language model