A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram Embeddings.
Wenyang LiuYi WangKejun WuKim-Hui YapLap-Pui ChauPublished in: AICAS (2023)
Keyphrases
- n gram
- image classification
- text classification
- image retrieval
- image representation
- classification accuracy
- viterbi algorithm
- language model
- bag of words
- decision trees
- feature vectors
- cellular neural networks
- feature extraction
- language modelling
- pseudorandom
- variable length
- binary codes
- word segmentation
- language independent
- knowledge representation
- hidden markov models
- convolutional neural network