The Multilingual TEDx Corpus for Speech Recognition and Translation.
Elizabeth SaleskyMatthew WiesnerJacob BremermanRoldano CattoniMatteo NegriMarco TurchiDouglas W. OardMatt PostPublished in: CoRR (2021)
Keyphrases
- speech recognition
- parallel corpus
- machine translation system
- comparable corpora
- statistical machine translation
- cross language information retrieval
- language model
- cross lingual
- machine translation
- parallel corpora
- language modeling
- hidden markov models
- cross language
- speech synthesis
- query translation
- language independent
- automatic speech recognition
- noisy environments
- translation model
- speech signal
- pattern recognition
- speech recognition technology
- speech recognizer
- speech processing
- handwriting recognition
- probabilistic model
- speaker identification
- word pairs
- speech recognition systems
- source language
- neural network
- speaker independent
- speech recognizers
- isolated word
- target language
- question answering
- text classification
- information extraction
- machine learning