ESPnet-ST IWSLT 2021 Offline Speech Translation System.
Hirofumi InagumaBrian YanSiddharth DalmiaPengcheng GuoJiatong ShiKevin DuhShinji WatanabePublished in: CoRR (2021)
Keyphrases
- broadcast news
- speech recognition
- machine translation
- speech signal
- language resources
- text to speech
- real time
- out of vocabulary
- audio visual
- recognition engine
- automatic speech recognition
- endpoint detection
- multi lingual
- spontaneous speech
- speech synthesis
- fundamental frequency
- formant frequencies
- spoken language
- speaker recognition
- speaker identification
- statistical machine translation
- emotion recognition
- noisy environments
- cross language information retrieval
- pattern recognition
- case study
- neural network
- data sets