An Automated Python Script for Data Cleaning and Labeling using Machine Learning Technique.
Matthew Abiola OladipupoPrincewill Chima ObuzorBabatunde Joseph BamgbadeEmmanuel Abidemi AdeniyiKazeem M. OlagunjuSunday Adeola AjagbePublished in: Informatica (Slovenia) (2023)
Keyphrases
- data cleaning
- machine learning
- text classification
- data integration
- active learning
- record linkage
- outlier detection
- data quality
- information extraction
- data processing
- missing values
- fraud detection
- database
- data warehouse
- text mining
- data warehousing
- decision trees
- feature selection
- natural language processing
- knowledge discovery
- web usage mining
- learning algorithm
- databases
- support vector machine
- data analysis
- natural language
- real world
- data management
- missing data
- integrity constraints
- case study