Lean GHTorrent: GitHub data on demand.
Georgios GousiosBogdan VasilescuAlexander SerebrenikAndy ZaidmanPublished in: MSR (2014)
Keyphrases
- data analysis
- data collection
- database
- data sets
- training data
- high quality
- data structure
- data distribution
- experimental data
- synthetic data
- historical data
- data quality
- original data
- test data
- application domains
- data sources
- machine learning
- statistical analysis
- computer systems
- data processing
- association rules
- raw data
- website
- complex data
- real time