PENTACET data - 23 Million Contextual Code Comments and 250,000 SATD comments.
Murali SridharanLeevi RantalaMika MäntyläPublished in: MSR (2023)
Keyphrases
- data analysis
- data sets
- small number
- high quality
- database
- synthetic data
- missing data
- statistical analysis
- computer systems
- training data
- data objects
- data structure
- original data
- data sources
- raw data
- data points
- knowledge discovery
- image data
- data distribution
- data collection
- search engine
- data quality
- sensor data
- spatial data
- context sensitive
- program code
- high dimensional data
- bit rate
- data processing
- software development
- probability distribution
- mobile devices
- website
- real world
- neural network
- databases