SciCat: A Curated Dataset of Scientific Software Repositories.
Addi Malviya-ThakurReed MilewiczLavinia PaganiniAhmed Samir Imam MahmoudAudris MockusPublished in: CoRR (2023)
Keyphrases
- software repositories
- address these issues
- source code
- open source projects
- software evolution
- data mining
- benchmark datasets
- defect prediction
- scientific databases
- software systems
- scientific data
- raw data
- artificial intelligence
- database
- software projects
- mining software repositories
- historical data
- synthetic datasets
- software design
- knowledge discovery
- database systems
- software artifacts
- version control
- source files