Time-Frequency Localization Using Deep Convolutional Maxout Neural Network in Persian Speech Recognition.
Arash DehghaniSeyyed Ali SeyyedsalehiPublished in: CoRR (2021)
Keyphrases
- speech recognition
- neural network
- pattern recognition
- deep learning
- hidden markov models
- speech recognizer
- automatic speech recognition
- speech signal
- language model
- speech processing
- signal processing
- speech synthesis
- speech understanding
- speech recognizers
- speech recognition systems
- noisy environments
- speech recognition technology
- speaker identification
- frequency domain
- speaker independent
- isolated word
- text retrieval
- text classification
- speaker diarization
- keyword spotting
- multilayer perceptron
- speaker dependent
- speech retrieval
- wavelet transform
- feature extraction