Methods for identifying 30 chronic conditions: application to administrative data.
Marcello TonelliNatasha WiebeMartin FortinBruce GuthrieBrenda R. HemmelgarnMatthew T. JamesScott W. KlarenbachRichard LewanczukBraden J. MannsPaul E. RonksleyPeter SargiousSharon E. StrausHude QuanPublished in: BMC Medical Informatics Decis. Mak. (2015)
Keyphrases
- data sets
- database
- synthetic data
- high quality
- statistical methods
- data structure
- data processing
- data collection
- noisy data
- missing data
- data sources
- significant improvement
- xml documents
- prior knowledge
- data analysis
- information systems
- training data
- data representations
- multiple sources
- data mining applications
- spectral clustering
- human experts
- learning algorithm
- missing values
- benchmark datasets
- statistical analysis
- end users
- small number
- data mining techniques
- preprocessing
- experimental data
- raw data
- original data
- semi supervised
- databases
- data mining methods
- image data
- learning models
- knowledge discovery
- incomplete data
- experimental conditions
- high dimensional data