An Ensemble Text Classification Model Combining Strong Rules and N-Gram.
Jinhong LiuYuliang LuPublished in: ICNC (3) (2007)
Keyphrases
- n gram
- language model
- character n grams
- word level
- language independent
- web documents
- language specific
- text classification
- language modeling
- information retrieval
- language modelling
- bag of words
- variable length
- part of speech
- text documents
- keywords
- viterbi algorithm
- inside outside algorithm
- neural network
- text retrieval
- training set
- word segmentation
- association rules
- artificial intelligence
- machine learning
- statistical language modeling
- information extraction
- document analysis
- feature selection
- databases