Eleven quick tips for data cleaning and feature engineering.
Davide ChiccoLuca OnetoErica TavazziPublished in: PLoS Comput. Biol. (2022)
Keyphrases
- data cleaning
- feature engineering
- text classification
- dependency parsing
- labeled data
- machine learning
- feature selection
- outlier detection
- text mining
- data quality
- data integration
- database
- natural language processing
- information extraction
- record linkage
- knn
- website
- data processing
- learning process
- pairwise
- training data
- neural network
- databases
- data sets