Anatomy of Hate Speech Datasets: Composition Analysis and Cross-dataset Classification.
Samuel S. GuimarãesGabriel KakizakiPhilipe F. MeloMárcio SilvaFabricio MuraiJulio C. S. ReisFabrício BenevenutoPublished in: HT (2023)
Keyphrases
- benchmark datasets
- uci datasets
- feature set
- statistical analysis
- data analysis
- classification accuracy
- uci repository
- uci machine learning repository
- pattern recognition
- training dataset
- classification method
- pattern classification
- feature vectors
- training set
- feature space
- three dimensional
- decision trees
- synthetic datasets
- supervised learning
- model selection
- text classification
- database
- support vector machine
- feature selection
- machine learning