Practical approach to determine sample size for building logistic prediction models using high-throughput data.
Dae-Soon SonDongHyuk LeeKyusang LeeSin-Ho JungTaeJin AhnEunjin LeeInsuk SohnJongsuk ChungWoong-Yang ParkNam HuhJae Won LeePublished in: J. Biomed. Informatics (2015)
Keyphrases
- sample size
- model selection
- prediction model
- statistical hypothesis testing
- covariance matrix
- probabilistic model
- training data
- small samples
- number of training samples
- vc dimension
- random sampling
- upper bound
- data sets
- linear models
- logistic regression
- predictive model
- accurate models
- high dimensional
- progressive sampling