Building gender-specific sexually transmitted infection risk prediction models using CatBoost algorithm and NHANES data.
Mengjie HuHan PengXuan ZhangLefeng WangJingjing RenPublished in: BMC Medical Informatics Decis. Mak. (2024)
Keyphrases
- easily interpretable
- input data
- optimal solution
- data sets
- noisy data
- experimental data
- probabilistic model
- learning algorithm
- data structure
- data reduction
- prior knowledge
- database
- historical data
- incomplete data
- detection algorithm
- data mining techniques
- data analysis
- np hard
- expectation maximization
- dynamic programming
- prior information
- preprocessing
- prediction error
- information loss
- computational complexity
- worst case
- data points
- surface meshes
- missing data
- synthetic datasets
- prediction model
- classification trees
- predictive model
- learned models
- k means
- prediction algorithm
- clustering method
- contingency tables
- parameter estimation
- similarity measure
- knowledge discovery
- statistical methods
- accurate models
- objective function
- data mining
- missing values
- high dimensional data
- xml documents
- segmentation algorithm
- training data
- data sources