An Open Dataset of Synthetic Speech.
Artem YaroshchukChristoforos PapastergiopoulosLuca CuccovilloPatrick AichrothKonstantinos VotisDimitrios TzovarasPublished in: WIFS (2023)
Keyphrases
- speech recognition
- speech synthesis
- benchmark datasets
- text to speech
- speech signal
- data sets
- real time
- text to speech synthesis
- real images are presented
- recognition engine
- speaker recognition
- spoken language
- automatic speech recognition
- audio visual
- human actions
- multi modal
- real life
- case study
- information systems
- real world