On Leakage in Machine Learning Pipelines.
Leonard SasseEliana Nicolaisen-SobeskyJuergen DukartSimon B. EickhoffMichael GötzSami HamdanVera KomeyerAbhijit KulkarniJuha LahnakoskiBradley C. LoveFederico RaimondoKaustubh R. PatilPublished in: CoRR (2023)
Keyphrases
- machine learning
- machine learning methods
- computer vision
- learning systems
- pattern recognition
- decision trees
- big data
- active learning
- learning algorithm
- computer science
- database
- knowledge discovery
- supervised learning
- text mining
- supervised machine learning
- machine learning approaches
- statistical learning
- inductive learning
- social media
- explanation based learning
- model selection
- text classification
- support vector machine
- natural language
- image sequences
- feature selection
- artificial intelligence
- databases
- data sets