Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset.
Janis GoldzycherPaul RöttgerGerold SchneiderPublished in: NAACL-HLT (2024)
Keyphrases
- data collection
- data analysis
- speech recognition
- ground truth
- data sets
- collecting data
- speech synthesis
- automatic speech recognition
- lessons learned
- benchmark datasets
- interactive video
- text to speech
- database
- decision support
- multi agent
- multi modal
- supervised learning
- training dataset
- sensor networks
- spoken language
- broadcast news
- speaker recognition
- case study
- machine learning
- neural network
- endpoint detection
- ground truth labels