Exploring LLM-based Data Augmentation Techniques for Code Comment Quality Classification.
Priyam DalmiaPublished in: FIRE (Working Notes) (2023)
Keyphrases
- data sets
- high quality
- classification accuracy
- database
- pattern classification
- data quality
- experimental data
- data structure
- synthetic data
- data analysis
- feature space
- data reduction
- data distribution
- high dimensional data
- data collection
- machine learning
- training data
- pattern recognition
- support vector
- original data
- missing values
- feature extraction
- prior knowledge
- statistical analysis
- data processing
- small number
- image data
- data points
- preprocessing
- text classification
- missing data
- spatial data
- feature vectors
- face recognition
- learning algorithm
- low quality
- data sources