Statistical data cleaning for deep learning of automation tasks from demonstrations.
Caleb ChuckMichael LaskeySanjay KrishnanRuta JoshiRoy FoxKen GoldbergPublished in: CASE (2017)
Keyphrases
- deep learning
- data cleaning
- data integration
- outlier detection
- record linkage
- data processing
- text classification
- data quality
- database
- unsupervised learning
- machine learning
- integrity constraints
- data warehousing
- databases
- fraud detection
- data warehouse
- missing values
- weakly supervised
- domain specific
- training data