Revisiting data complexity metrics based on morphology for overlap and imbalance: snapshot, new overlap number of balls metrics and singular problems prospect.
José Daniel Pascual-TrianaDavid CharteMarta Andrés ArroyoAlberto FernándezFrancisco HerreraPublished in: Knowl. Inf. Syst. (2021)
Keyphrases
- data sets
- small number
- data collection
- data sources
- database
- high quality
- statistical methods
- application domains
- knowledge discovery
- small fraction
- huge number
- synthetic data
- data processing
- training data
- data objects
- data mining applications
- image data
- multi class
- data points
- input data
- probability distribution
- prior knowledge
- data analysis
- computational complexity
- data structure
- raw data
- class imbalance
- highly correlated
- neural network