AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies.
Sourish ChaudhuriJoseph RothDaniel P. W. EllisAndrew C. GallagherLiat KaverRadhika MarvinCaroline PantofaruNathan RealeLoretta Guarino ReidKevin W. WilsonZhonghua XiPublished in: INTERSPEECH (2018)
Keyphrases
- speech recognition
- speech signal
- automatic speech recognition
- speech synthesis
- audio visual
- recognition engine
- manually labeled
- speaker recognition
- database
- endpoint detection
- broadcast news
- speaker identification
- spoken language
- speech recognizer
- pattern recognition
- training data
- learning algorithm
- object detection
- active learning
- training set
- vocal tract
- audio stream