A Comparison of Machine Learning Methods for Extremely Unbalanced Industrial Quality Data.
Pedro José PereiraAdriana PereiraPaulo CortezAndré Luiz PilastriPublished in: EPIA (2021)
Keyphrases
- data sets
- high quality
- data quality
- raw data
- data distribution
- training data
- data processing
- data collection
- database
- statistical analysis
- original data
- experimental data
- data analysis
- input data
- image data
- high dimensional data
- big data
- knowledge discovery
- data acquisition
- information retrieval
- data points
- low quality
- prior knowledge
- quality improvement
- historical data
- data mining
- noisy data
- social networks
- application domains
- spatial data
- missing data
- data mining techniques
- small number