Optimal subsample selection for massive logistic regression with distributed data.
Lulu ZuoHaixiang ZhangHaiYing WangLiuquan SunPublished in: Comput. Stat. (2021)
Keyphrases
- logistic regression
- distributed data
- decision trees
- naive bayes
- data sharing
- logistic regression models
- credit scoring
- support vector
- linear support vector machines
- loss function
- data distribution
- odds ratio
- data mining algorithms
- linear svm
- active learning
- learning algorithm
- file system
- classification trees
- text classification
- similarity measure
- feature selection