MuST-Cinema: a Speech-to-Subtitles corpus.

Alina Karakanta Matteo Negri Marco Turchi

Published in: CoRR (2020)

Keyphrases

spontaneous speech
speech recognition
conversational speech
lexical features
spoken language
speech signal
automatic speech recognition
speech synthesis
text to speech
spoken dialog
recognition engine
manually annotated
open domain
supervised machine learning
spanish language
vocal tract
speech processing
audio visual
multi modal
multimodal interfaces
human machine interaction
spoken document retrieval
speaker identification
linguistic features
text data
test set
error rate
non stationary