Data Leakage in Notebooks: Static Detection and Better Processes.
Chenyang YangRachel A. Brower-SinningGrace A. LewisChristian KästnerPublished in: CoRR (2022)
Keyphrases
- data sets
- image data
- data collection
- training data
- data analysis
- data sources
- database
- small number
- complex data
- data processing
- high quality
- neural network
- input data
- synthetic data
- information systems
- knowledge discovery
- data points
- sensor networks
- data mining techniques
- probability distribution
- anomaly detection
- data acquisition