At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech.
Maximilian SchmittFabien RingevalBjörn W. SchullerPublished in: INTERSPEECH (2016)
Keyphrases
- automatic transcription
- emotion recognition
- speech corpus
- continuous speech recognition
- spontaneous speech
- spoken words
- speech music discrimination
- text recognition
- speech recognition systems
- automatic speech recognition
- recognition engine
- audio stream
- audio visual
- speech recognition
- broadcast news
- word recognition
- speech sounds
- emotional state
- recognition rate
- visual features
- character recognition
- visual speech
- human language
- speech synthesis
- speaker identification
- facial expressions
- language learning
- prosodic features
- pattern recognition
- emotion classification
- digital audio
- human computer interaction
- multimedia
- handwriting recognition
- cognitive science
- bag of words
- spoken language
- word segmentation
- acoustic signals
- text to speech
- emotional speech
- hidden markov models
- document analysis
- object recognition
- natural language
- human machine interaction