Semi-supervised Classification of Malware Families Under Extreme Class Imbalance via Hierarchical Non-Negative Matrix Factorization with Automatic Model Selection.
Maksim Ekin ErenManish BhattaraiRobert J. JoyceEdward RaffCharles NicholasBoian S. AlexandrovPublished in: CoRR (2023)
Keyphrases
- negative matrix factorization
- class imbalance
- class distribution
- active learning
- matrix factorization
- unlabeled data
- sparse representation
- semi supervised
- cost sensitive
- principal component analysis
- document clustering
- semi supervised learning
- labeled data
- spectral clustering
- training data
- model selection
- data mining
- natural language processing
- text classification
- data points
- data analysis
- feature selection
- learning algorithm