MuST-Cinema: a Speech-to-Subtitles corpus.
Alina KarakantaMatteo NegriMarco TurchiPublished in: LREC (2020)
Keyphrases
- spontaneous speech
- conversational speech
- speech recognition
- lexical features
- automatic speech recognition
- speech signal
- test set
- human machine interaction
- broadcast news
- manually annotated
- spanish language
- audio visual
- spoken language
- speaker identification
- speech processing
- recognition engine
- endpoint detection
- speaker recognition
- dialogue system
- open domain
- multi stream
- vocal tract
- text to speech
- spoken dialogue systems
- language acquisition
- tv broadcast
- tv series
- information extraction