On the Accuracy and Scalability of Probabilistic Data Linkage Over the Brazilian 114 Million Cohort.
Robespierre PitaClícia PintoSamila SenaRosemeire FiacconeLeila AmorimSandra ReisMauricio BarretoSpiros C. DenaxasMarcos E. BarretoPublished in: IEEE J. Biomed. Health Informatics (2018)
Keyphrases
- data collection
- data sets
- data points
- synthetic data
- image data
- data sources
- data reduction
- original data
- sensor data
- high accuracy
- prior knowledge
- data analysis
- high quality
- databases
- relational databases
- probabilistic model
- classification accuracy
- data structure
- data mining techniques
- data processing
- training data
- error rate
- search engine
- data mining
- data distribution
- database