World of code: an infrastructure for mining the universe of open source VCS data.
Yuxing MaChris BogartSadika AmreenRussell ZaretzkiAudris MockusPublished in: MSR (2019)
Keyphrases
- data sets
- data collection
- open source
- complex data
- data processing
- raw data
- database
- interesting patterns
- high quality
- prior knowledge
- synthetic data
- transactional data
- data mining applications
- original data
- data distribution
- source code
- data analysis
- statistical analysis
- data structure
- sensor data
- input data
- data mining techniques
- data quality
- training data
- image data
- knowledge discovery