37 Million Compilations: Investigating Novice Programming Mistakes in Large-Scale Student Data.
Amjad AlTadmriNeil C. C. BrownPublished in: SIGCSE (2015)
Keyphrases
- data sets
- data analysis
- database
- raw data
- training data
- high quality
- data structure
- data processing
- data collection
- database systems
- data quality
- original data
- computer systems
- massive scale
- computer programming
- privacy preserving
- statistical analysis
- software engineering
- knowledge discovery
- probability distribution
- data sources