World of code: enabling a research workflow for mining and analyzing the universe of open source VCS data.
Yuxing MaTapajit DeyChris BogartSadika AmreenMarat ValievAdam TutkoDavid KennardRussell ZaretzkiAudris MockusPublished in: Empir. Softw. Eng. (2021)
Keyphrases
- data sets
- open source
- data structure
- data collection
- raw data
- training data
- source code
- statistical analysis
- data mining methods
- data mining applications
- database
- data sources
- text mining
- interesting patterns
- knowledge discovery
- data processing
- access control
- missing data
- spatial data
- data analysis
- original data
- hidden knowledge
- exploratory analysis
- data quality
- input data
- data mining techniques
- image data
- probability distribution
- end users