Sharing and Reusing Data and Analytic Methods with LearnSphere.
Ran LiuKenneth R. KoedingerJohn C. StamperPhilip I. Pavlik Jr.Published in: EDM (2017)
Keyphrases
- data sets
- data mining methods
- statistical methods
- computer systems
- historical data
- computational cost
- databases
- data analysis
- preprocessing
- significant improvement
- data points
- original data
- missing values
- data mining techniques
- benchmark datasets
- data collection
- training data
- data reduction
- neural network
- spectral clustering
- experimental data
- high quality
- knowledge discovery
- data processing
- data sources
- data representations
- statistical significance
- complex structures
- decision trees
- multiple sources
- statistical tests
- learning models
- data mining applications
- machine learning
- data quality
- raw data
- data mining
- small number
- dimensionality reduction
- missing data
- input data
- synthetic data
- statistical analysis
- database