Text Classification Improved through Automatically Extracted Sequences.
Dou ShenJian-Tao SunQiang YangHui ZhaoZheng ChenPublished in: ICDE (2006)
Keyphrases
- automatically extracted
- text classification
- bag of words
- feature selection
- manually created
- text mining
- text categorization
- naive bayes
- hidden markov models
- sentiment analysis
- machine learning
- n gram
- data cleaning
- visually similar
- domain specific
- databases
- labeled data
- text documents
- text data
- information retrieval
- feature reduction