Automatic Evaluation of Document Classification Using N-Gram Statistics.
Dongjin ChoiByeong-Kyu KoEunji LeeMyunggwon HwangPankoo KimPublished in: NBiS (2012)
Keyphrases
- test collection
- document classification
- n gram
- automatic evaluation
- language model
- text classification
- web documents
- language modeling
- text mining
- text categorization
- quality assessment
- text documents
- bag of words
- document collections
- classification algorithm
- machine learning
- naive bayes
- feature selection
- support vector machine
- web search engines
- databases
- labeled data
- knn
- decision trees
- human judgments
- data mining