On the Cost of Mining Very Large Open Source Repositories.
Sean BanerjeeBojan CukicPublished in: BIGDSE@ICSE (2015)
Keyphrases
- open source
- source code
- software repositories
- mining software repositories
- open source projects
- open source software
- open source software projects
- knowledge discovery
- total cost
- mining algorithm
- fit in main memory
- software projects
- database
- expected cost
- minimum cost
- cost sensitive
- sequential patterns
- software evolution
- frequent itemsets
- software systems
- itemsets
- digital libraries
- data repositories
- map reduce
- pattern mining
- data mining applications
- web mining
- association rule mining
- text mining
- software engineering
- high level
- case study
- databases
- data sets