Keyphrases
- logistic regression
- massive data
- rare events
- data mining applications
- importance sampling
- decision trees
- fraud detection
- support vector
- distributed data
- naive bayes
- big data
- random forests
- loss function
- class distribution
- class imbalance
- feature selection
- classification accuracy
- high dimensional
- training set
- feature space