World of Code: Enabling a Research Workflow for Mining and Analyzing the Universe of Open Source VCS data.
Yuxing MaTapajit DeyChris BogartSadika AmreenMarat ValievAdam TutkoDavid KennardRussell ZaretzkiAudris MockusPublished in: CoRR (2020)
Keyphrases
- process model
- open source
- data analysis
- data sets
- training data
- petri net
- data mining methods
- data mining techniques
- hidden knowledge
- data mining applications
- multimedia data
- data mining algorithms
- data collection
- knowledge discovery
- database
- image data
- statistical analysis
- synthetic data
- interesting patterns
- source code
- high quality
- data quality
- data sources
- exploratory analysis
- transactional data
- web logs
- historical data
- data structure
- raw data
- pattern mining
- end users
- data points
- access control
- data processing
- input data
- text mining