Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching.
Shiqing ZhangShiliang ZhangTiejun HuangWen GaoPublished in: IEEE Trans. Multim. (2018)
Keyphrases
- convolutional neural network
- emotion recognition
- face detection
- text to speech synthesis
- emotional state
- spatio temporal
- multiscale
- emotional speech
- matching algorithm
- image matching
- temporal information
- discriminant analysis
- neural network
- facial expressions
- speech signal
- matching process
- multiresolution
- temporal data
- temporal constraints
- graph matching
- spatial and temporal
- speech recognition
- feature points
- audio visual
- automatic speech recognition
- low dimensional
- noisy environments
- image classification
- spoken language
- speech synthesis
- spatial pyramid matching
- keypoints
- space time