Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis.
Xilin JiangYinghao Aaron LiAdrian Nicolas FloreaCong HanNima MesgaraniPublished in: CoRR (2024)
Keyphrases
- recognition engine
- speech recognition
- speech signal
- automatic speech recognition systems
- noisy environments
- speech corpus
- phoneme recognition
- text recognition
- pattern recognition
- speaker dependent
- digit recognition
- audio visual
- automatic speech recognition
- speech synthesis
- language acquisition
- continuous speech recognition
- endpoint detection
- recognition accuracy
- speech sounds
- automatic recognition
- spoken language
- action recognition
- text to speech
- recognition rate
- speech recognition systems
- broadcast news
- feature extraction
- speaker identification
- emotion recognition
- activity recognition
- spoken words
- automatic transcription
- video sequences
- object recognition
- multi modal
- audio stream
- human computer interaction
- english text
- speaker independent
- handwriting recognition
- computer vision
- vocal tract
- speaker recognition