Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset.
Janis GoldzycherPaul RöttgerGerold SchneiderPublished in: CoRR (2024)
Keyphrases
- data collection
- speech recognition
- lessons learned
- ground truth
- benchmark datasets
- spoken language
- sensor networks
- data analysis
- recognition engine
- data entry
- wireless sensor networks
- collecting data
- speech signal
- synthetic datasets
- interactive video
- noisy environments
- language acquisition
- audio visual
- dialogue system
- databases
- learning environment
- neural network