A method and software framework for enriching private biomedical sources with data from public online repositories.
Alberto AnguitaMiguel García-RemesalNorbert M. GrafVictor MaojoPublished in: J. Biomed. Informatics (2016)
Keyphrases
- synthetic data
- data sources
- input data
- data sets
- data collection
- probabilistic model
- objective function
- computer systems
- noisy data
- statistical methods
- test data
- missing data
- preprocessing
- prior knowledge
- data analysis
- information loss
- data processing
- missing values
- heterogeneous data sources
- information sources
- main contribution
- database
- training data
- software systems
- clustering method
- raw data
- multiple sources
- source code
- large scale data sets