Multichannel Audio Front-End for Far-Field Automatic Speech Recognition.
Amit S. ChhetriPhilip HilmesTrausti KristjanssonWai ChuMohamed MansourXiaoxue LiXianxian ZhangPublished in: EUSIPCO (2018)
Keyphrases
- automatic speech recognition
- broadcast news
- acoustic features
- speech recognition
- speech signal
- speaker identification
- linear prediction
- hidden markov models
- conversational speech
- word error rate
- spontaneous speech
- noisy environments
- speech retrieval
- multimedia
- spoken words
- word recognition
- speech corpus
- visual information
- neural network
- audio features
- audio visual
- music information retrieval
- spoken document retrieval
- single channel
- visual data
- computer vision
- speaker diarization
- recognition errors
- multiresolution