ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection.
Thomas HartvigsenSaadia GabrielHamid PalangiMaarten SapDipankar RayEce KamarPublished in: ACL (1) (2022)
Keyphrases
- detection method
- detection accuracy
- false alarms
- object detection
- detection algorithm
- real world
- noisy environments
- object detectors
- small scale
- false positives
- speech recognition
- benchmark datasets
- real life
- voice activity detection
- text recognition
- automatic speech recognition
- audio visual
- automatically generated
- synthetic datasets
- hidden markov models
- database
- spoken language
- text to speech
- audio stream
- endpoint detection
- detection rate