End-to-End Multi-Modal Speech Recognition on an Air and Bone Conducted Speech Corpus.
Mou WangJunqi ChenXiao-Lei ZhangSusanto RahardjaPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
- multi modal
- end to end
- speech recognition
- speech corpus
- speech synthesis
- automatic speech recognition
- speech signal
- language model
- hidden markov models
- pattern recognition
- broadcast news
- high dimensional
- audio visual
- speaker identification
- text to speech
- speech recognition systems
- information retrieval
- video search
- speaker independent