Token replacement-based data augmentation methods for hate speech detection.
Kosisochukwu Judith MadukweXiaoying GaoBing XuePublished in: World Wide Web (2022)
Keyphrases
- data sets
- data collection
- training data
- data analysis
- statistical methods
- data mining techniques
- significant improvement
- data mining methods
- human experts
- high dimensional data
- data points
- missing values
- data processing
- multimedia data
- missing data
- multiple sources
- synthetic data
- statistical analysis
- image data
- probability distribution
- computational cost
- database
- experimental data
- preprocessing
- data structure
- databases
- audio stream