Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition.
Lei LiuLi LiuHaizhou LiPublished in: CoRR (2024)
Keyphrases
- speech recognition
- language model
- hidden markov models
- pattern recognition
- speech processing
- automatic speech recognition
- speech signal
- noisy environments
- speech synthesis
- speaker identification
- speech understanding
- speech recognition technology
- speech recognizer
- keyword spotting
- speaker recognition
- speech recognizers
- multi modal fusion
- speech recognition errors
- detection method
- audio visual speech recognition