Character Variable Numeralization Based on Dimension Expanding and its Application on Text Classification.
Li-xun XuXu YuYong WangYun-xia FengPublished in: ICYCSEE (1) (2016)
Keyphrases
- text classification
- bag of words
- feature selection
- machine learning
- text data
- text mining
- text categorization
- document classification
- sentiment analysis
- naive bayes
- n gram
- multi label
- text documents
- data cleaning
- semantic features
- artificial intelligence
- labeled data
- active learning
- chinese characters
- data mining
- text classifiers
- data sets