A study of the impact of generative AI-based data augmentation on software metadata classification.
Tripti KumariChakali Sai CharanAyan DasPublished in: CoRR (2023)
Keyphrases
- data sets
- data analysis
- statistical analysis
- machine learning
- data collection
- digital libraries
- metadata
- data quality
- computer systems
- multimedia data
- artificial intelligence
- database
- training data
- pattern recognition
- training set
- xml documents
- classification accuracy
- unsupervised learning
- geo referenced
- receiver operating characteristic curves
- training samples
- generative model
- data processing
- input data
- supervised learning
- multi class
- support vector machine
- knowledge discovery
- data points
- probability distribution
- feature extraction
- decision trees
- databases