Similarity-Based Attribute Selection with n-Grams.
Alexander UlanovGerman SapozhnikovGeorgy ShevlyakovNikolay LyubomishchenkoPublished in: MLDM Posters (2011)
Keyphrases
- n gram
- attribute selection
- decision trees
- language model
- decision tree induction
- text classification
- variable length
- information gain
- classification models
- bag of words
- naive bayes
- part of speech
- feature selection
- web documents
- character n grams
- data sets
- inside outside algorithm
- probability estimation
- text categorization
- artificial intelligence
- neural network