Document Classification through Building Specified N-Gram.
Byeong-Kyu KoDongjin ChoiChang ChoiJunho ChoiPankoo KimPublished in: IMIS (2012)
Keyphrases
- n gram
- document classification
- text classification
- web documents
- bag of words
- language model
- text categorization
- text mining
- text documents
- language modeling
- variable length
- feature selection
- text classifiers
- word segmentation
- classification algorithm
- probabilistic model
- training set
- labeled data
- machine learning
- character n grams
- retrieval model
- neural network
- naive bayes
- co occurrence
- information extraction
- knn
- training data
- artificial intelligence
- data mining