N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition.
Bashar TalafhaAbdul WaheedMuhammad Abdul-MageedPublished in: INTERSPEECH (2023)
Keyphrases
- speech recognition
- isolated word
- handwriting recognition
- hidden markov models
- pattern recognition
- automatic speech recognition
- speech synthesis
- speech processing
- language model
- video sequences
- speech signal
- speech understanding
- speech recognition technology
- speaker identification
- speech recognition systems
- speech recognizer
- keyword spotting
- noisy environments
- speaker independent
- language identification
- visual features
- video shots
- video data
- speech recognizers
- news video
- speaker diarization
- neural network
- image processing
- computer vision