On-the-fly ASR Corrections with Audio Exemplars.
Golan PundakTsendsuren MunkhdalaiKhe Chai SimPublished in: INTERSPEECH (2022)
Keyphrases
- automatic speech recognition
- broadcast news
- multimedia
- speech recognition
- spontaneous speech
- signal processing
- acoustic features
- audio visual
- visual information
- real time
- cepstral features
- audio stream
- music genre classification
- audio recordings
- audio video
- audio signals
- multimedia information
- case study
- cross modal
- noisy environments
- image retrieval
- pattern recognition
- training data
- information systems
- audio content
- neural network
- music scores
- database