On the Readiness of Scientific Data for a Fair and Transparent Use in Machine Learning.
Joan Giner-MiguelezAbel GómezJordi CabotPublished in: CoRR (2024)
Keyphrases
- scientific data
- machine learning
- data management
- life sciences
- data collection
- scientific data sets
- scientific databases
- massive amounts of data
- scientific information
- hierarchical data
- learning algorithm
- geographically distributed
- evaluation campaigns
- bitmap indices
- scientific data management
- scientific disciplines
- machine learning methods
- temporal relationships
- biological pathways
- molecular dynamics
- data warehouse
- database systems
- knowledge discovery
- natural language processing
- drug discovery
- high energy physics
- data sets
- information extraction
- data mining