Leakage and the reproducibility crisis in machine-learning-based science.
Sayash KapoorArvind NarayananPublished in: Patterns (2023)
Keyphrases
- machine learning
- computer science
- data mining
- artificial intelligence
- interdisciplinary field
- natural language processing
- machine learning algorithms
- pattern recognition
- learning algorithm
- machine learning methods
- feature selection
- learning tasks
- decision trees
- databases
- explanation based learning
- knowledge acquisition
- text classification
- learning systems
- machine learning approaches
- real time
- active learning
- neural network
- knowledge discovery
- reinforcement learning
- database
- crisis management
- earth science
- data sets
- computational biology
- learning problems
- genetic algorithm
- computer vision
- natural language
- semi supervised learning
- data analysis
- decision makers
- knowledge representation
- supervised learning
- information extraction