An Approach for Spam E-mail Detection with Support Vector Machine and n-Gram Indexing.
Jongsub MoonTaeshik ShonJung-Taek SeoJongho KimJungwoo SeoPublished in: ISCIS (2004)
Keyphrases
- n gram
- support vector machine
- language model
- bag of words
- spam e mail
- language independent
- text classification
- language modelling
- language modeling
- variable length
- part of speech
- character n grams
- training data
- information retrieval
- viterbi algorithm
- hidden markov models
- knn
- feature vectors
- databases
- machine learning
- detection algorithm
- word segmentation
- neural network