Login / Signup
SAGA: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications.
Shafaq Siddiqi
Roman Kern
Matthias Boehm
Published in:
Proc. ACM Manag. Data (2023)
Keyphrases
</>
machine learning
data cleaning
information extraction
data integration
feature selection
data model
knowledge discovery
text mining
decision support system
text classification
data processing
data quality
data extraction