MuST-Cinema: a Speech-to-Subtitles corpus.
Alina KarakantaMatteo NegriMarco TurchiPublished in: CoRR (2020)
Keyphrases
- spontaneous speech
- speech recognition
- conversational speech
- lexical features
- spoken language
- speech signal
- automatic speech recognition
- speech synthesis
- text to speech
- spoken dialog
- recognition engine
- manually annotated
- open domain
- supervised machine learning
- spanish language
- vocal tract
- speech processing
- audio visual
- multi modal
- multimodal interfaces
- human machine interaction
- spoken document retrieval
- speaker identification
- linguistic features
- text data
- test set
- error rate
- non stationary