Human and Transformer-Based Prosodic Phrasing in Two Speech Genres.
Jan VolínMarkéta RezáckováJindrich MatousekPublished in: SPECOM (2021)
Keyphrases
- speech recognition
- text to speech synthesis
- language acquisition
- speech synthesis
- text to speech
- prosodic features
- human subjects
- artificial intelligence
- human communication
- audio visual
- automatic speech recognition
- human interaction
- genre classification
- speech signal
- data sets
- human centered
- fault diagnosis
- multi modal
- text classification
- vocal tract
- fuzzy logic
- synthesized speech
- neural network