Data Leakage in Notebooks: Static Detection and Better Processes.
Chenyang YangRachel A. Brower-SinningGrace A. LewisChristian KästnerPublished in: ASE (2022)
Keyphrases
- data sets
- database
- data analysis
- knowledge discovery
- image data
- data mining
- machine learning
- raw data
- data collection
- data processing
- input data
- small number
- synthetic data
- detection algorithm
- data mining techniques
- end users
- high quality
- decision trees
- data sources
- training data
- image sequences
- sensor data
- experimental data
- data acquisition
- network structure
- noisy data
- complex data