RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech.
Kyumin ParkKeon LeeDaeyoung KimDongyeop KangPublished in: CoRR (2022)
Keyphrases
- benchmark datasets
- database
- input image
- audio visual
- automatic speech recognition
- object recognition
- annotated images
- speech recognition
- image set
- endpoint detection
- language acquisition
- region segmentation
- manually annotated
- noisy environments
- speech signal
- region of interest
- image regions
- feature set
- natural language