RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech.

Kyumin Park Keon Lee Daeyoung Kim Dongyeop Kang

Published in: CoRR (2022)

Keyphrases

benchmark datasets
database
input image
audio visual
automatic speech recognition
object recognition
annotated images
speech recognition
image set
endpoint detection
language acquisition
region segmentation
manually annotated
noisy environments
speech signal
region of interest
image regions
feature set
natural language