On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data.
Federico ErricaPublished in: NeurIPS (2023)
Keyphrases
- class distribution
- nearest neighbor
- tabular data
- class imbalance
- training set
- roc analysis
- decision boundary
- cost sensitive
- training samples
- highly skewed
- class labels
- classification accuracy
- imbalanced datasets
- knn
- text classification
- training data
- machine learning
- high dimensional
- supervised learning
- k nearest neighbor
- support vector machine
- misclassification costs
- classification algorithm
- decision trees
- feature extraction
- support vector
- feature vectors
- feature space
- feature selection
- small number
- roc curve
- base classifiers
- decision rules
- active learning
- high dimensional data
- text categorization
- principal component analysis
- multi class