Use of subword tokenization for domain generation algorithm classification.
Sea Ran Cleon LiewNgai-Fong LawPublished in: Cybersecur. (2023)
Keyphrases
- generation algorithm
- pattern recognition
- machine learning
- pattern classification
- classification accuracy
- classification systems
- classification models
- image classification
- feature vectors
- domain independent
- feature extraction
- feature space
- n gram
- supervised learning
- domain specific
- machine learning methods
- support vector machine
- classification algorithm
- decision rules
- benchmark datasets
- classification rules
- model selection
- preprocessing
- information retrieval
- classification scheme
- spoken document retrieval
- classification rate
- classification process
- data sets
- text categorization
- training samples
- feature set
- hidden markov models
- training set
- support vector
- feature selection
- data mining