The Multilingual TEDx Corpus for Speech Recognition and Translation.
Elizabeth SaleskyMatthew WiesnerJacob BremermanRoldano CattoniMatteo NegriMarco TurchiDouglas W. OardMatt PostPublished in: Interspeech (2021)
Keyphrases
- speech recognition
- parallel corpus
- machine translation system
- comparable corpora
- cross language information retrieval
- statistical machine translation
- language model
- parallel corpora
- cross lingual
- machine translation
- cross language
- automatic speech recognition
- hidden markov models
- language modeling
- language independent
- query translation
- speech recognizer
- translation model
- pattern recognition
- speech processing
- noisy environments
- speech signal
- speech synthesis
- speaker identification
- speech recognition systems
- conversational speech
- speech retrieval
- source language
- speech recognition technology
- information retrieval
- target language
- speaker independent
- speech recognizers
- handwriting recognition
- word pairs
- text retrieval
- natural language processing
- image processing
- feature selection