Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training.
Ashish R. MittalDurga SivasubramanianRishabh K. IyerPreethi JyothiGanesh RamakrishnanPublished in: EMNLP (Findings) (2022)
Keyphrases
- data sets
- raw data
- data analysis
- subset selection
- training samples
- database
- machine learning
- synthetic data
- knowledge discovery
- data collection
- training data
- data processing
- spatial data
- small number
- data quality
- particle swarm optimization
- computer systems
- statistical analysis
- computationally efficient
- pattern matching
- data distribution
- original data