ActiveClean: Generating Line-Level Vulnerability Data via Active Learning.
Ashwin Kallingal JoshyMirza Sanjida AlamShaila SharminQi LiWei LePublished in: CoRR (2023)
Keyphrases
- active learning
- data sets
- end users
- database
- data processing
- website
- data analysis
- raw data
- experimental data
- image data
- complex data
- data objects
- synthetic data
- high dimensional data
- data collection
- knowledge discovery
- data sources
- training data
- data mining techniques
- input data
- semi supervised
- statistical analysis
- probability distribution
- missing data
- learning strategies
- prior knowledge
- data distribution
- association rules
- noisy data
- decision trees