Impact of Word Segmentation Errors on Automatic Chinese Text Classification.
Xi LuoWataru OhyamaTetsushi WakabayashiFumitaka KimuraPublished in: Document Analysis Systems (2012)
Keyphrases
- text classification
- segmentation errors
- word segmentation
- n gram
- text categorization
- bag of words
- chinese text
- distributional clustering
- chinese word segmentation
- knn
- text mining
- feature selection
- text classifiers
- term frequency
- language modeling
- data mining
- co occurrence
- machine learning
- multi label
- sentiment analysis
- image processing
- text documents
- semantic features
- keywords
- unknown words
- keyword extraction
- computer vision
- feature space
- recognition rate