N-grams based feature selection and text representation for Chinese Text Classification.
Zhihua WeiDuoqian MiaoJean-Hugues ChauchatRui ZhaoWen LiPublished in: Int. J. Comput. Intell. Syst. (2009)
Keyphrases
- text classification
- n gram
- text representation
- feature selection
- text categorization
- bag of words
- text documents
- machine learning
- text mining
- classification accuracy
- text data
- labeled data
- language modeling
- knn
- term frequency
- part of speech
- unlabeled data
- semantic features
- information retrieval
- concept learning
- document retrieval
- keywords
- question answering
- neural network
- decision trees