Benchmark pour la classification de commentaires toxiques sur le jeu de données Civil Comments.
Corentin DucheneHenri JametPierre GuillaumeRéda DehakPublished in: EGC (2023)
Keyphrases
- classification accuracy
- pattern classification
- decision trees
- pattern recognition
- support vector
- classification rate
- feature extraction
- text classification
- supervised learning
- feature space
- case study
- feature selection
- training set
- high dimensional
- feature vectors
- classification scheme
- classification algorithm
- svm classifier
- support vector machine svm
- database
- supervised classification
- feature set
- knn
- preprocessing
- machine learning
- data mining
- real world
- data sets