Data Cleaning and Machine Learning: A Systematic Literature Review.
Pierre-Olivier CôtéAmin NikanjamNafisa AhmedDmytro HumeniukFoutse KhomhPublished in: CoRR (2023)
Keyphrases
- literature review
- data cleaning
- machine learning
- text classification
- data integration
- outlier detection
- record linkage
- data quality
- information extraction
- case study
- data processing
- text mining
- data warehousing
- knowledge discovery
- feature selection
- missing values
- data warehouse
- decision trees
- fraud detection
- database
- active learning
- data analysis
- query evaluation
- data mining
- databases
- naive bayes
- web usage mining
- website
- search engine
- web data
- real world
- data sets