Hash-Grams: Faster N-Gram Features for Classification and Malware Detection.
Edward RaffCharles NicholasPublished in: DocEng (2018)
Keyphrases
- n gram
- text classification
- feature vectors
- language model
- classification accuracy
- malware detection
- feature set
- feature space
- feature extraction
- bag of words
- language modelling
- language independent
- language modeling
- machine learning
- variable length
- model selection
- support vector machine
- text mining
- part of speech
- information retrieval
- intrusion detection
- image classification
- information extraction
- word segmentation
- viterbi algorithm
- naive bayes
- neural network
- naive bayes classifier
- training set
- inside outside algorithm