Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech.
Shijun WangJón GuðnasonDamian BorthPublished in: INTERSPEECH (2023)
Keyphrases
- text to speech
- data sets
- prior knowledge
- training data
- background knowledge
- learning algorithm
- database
- data collection
- word processing
- learning process
- original data
- supervised learning
- data sources
- high dimensional data
- learning tasks
- audio stream
- data points
- data mining techniques
- emotion recognition
- multiple representations
- data analysis
- data mining
- prosodic features
- speech emotion recognition