Textless Speech Emotion Conversion using Discrete & Decomposed Representations.
Felix KreukAdam PolyakJade CopetEugene KharitonovTu Anh NguyenMorgane RivièreWei-Ning HsuAbdelrahman MohamedEmmanuel DupouxYossi AdiPublished in: EMNLP (2022)
Keyphrases
- emotion recognition
- text to speech synthesis
- emotional state
- speech recognition
- emotional speech
- facial expressions
- finite number
- automatic speech recognition
- text to speech
- audio visual
- dialogue system
- multiple representations
- speech synthesis
- data sets
- information retrieval
- endpoint detection
- recognition engine
- multi stream
- discrete geometry
- spoken language
- neural network
- speech signal