Also for k-means: more data does not imply better performance.
Marco LoogJesse H. KrijtheManuele BicegoPublished in: Mach. Learn. (2023)
Keyphrases
- data sets
- synthetic data
- raw data
- data sources
- data points
- data processing
- original data
- data analysis
- missing data
- image data
- data mining techniques
- complex data
- database
- experimental data
- statistical analysis
- decision trees
- neural network
- data collection
- input data
- knowledge discovery
- probabilistic model
- high quality
- data acquisition
- statistical methods
- data mining
- data quality