ZeroIn: Characterizing the Data Distributions of Commits in Software Repositories.
Kalyan PerumallaAradhana SoniRupam DeySteven RichPublished in: CoRR (2022)
Keyphrases
- data distribution
- software repositories
- address these issues
- source code
- raw data
- data streams
- software evolution
- index structure
- software systems
- data points
- historical data
- mining software repositories
- software projects
- software components
- concept drift
- high dimensional data
- open source
- data skew
- software design
- data processing
- multi dimensional
- software engineering
- nearest neighbor
- streaming data
- feature space
- real world
- neural network
- databases