An investigation of byte n-gram features for malware classification.
Edward RaffRichard ZakRussell CoxJared SylvesterPaul YacciRebecca WardAnna TracyMark McLeanCharles NicholasPublished in: J. Comput. Virol. Hacking Tech. (2018)
Keyphrases
- n gram
- text classification
- feature vectors
- classification accuracy
- feature set
- feature extraction
- classification method
- language model
- feature space
- class labels
- svm classifier
- rich set
- bag of words
- word segmentation
- language independent
- training set
- language modelling
- variable length
- co occurrence
- image classification
- text mining
- support vector
- part of speech
- inside outside algorithm
- machine learning
- language modeling
- information retrieval systems
- decision trees
- search engine
- artificial intelligence
- information retrieval