ReClean: Reinforcement Learning for Automated Data Cleaning in ML Pipelines.
Mohamed AbdelaalAnil Bora YayakKai KledeHarald SchöningPublished in: ICDEW (2024)
Keyphrases
- data cleaning
- reinforcement learning
- data integration
- data quality
- text classification
- outlier detection
- record linkage
- database
- data processing
- data warehousing
- data warehouse
- information extraction
- missing values
- machine learning
- web usage mining
- fraud detection
- website
- multi class
- data extraction
- detection algorithm
- integrity constraints
- information retrieval
- databases