TIMIT-TTS: A Text-to-Speech Dataset for Multimodal Synthetic Media Detection.
Davide SalviBrian C. HoslerPaolo BestaginiMatthew C. StammStefano TubaroPublished in: IEEE Access (2023)
Keyphrases
- text to speech
- multimodal interaction
- speech synthesis
- prosodic features
- speech corpus
- programming tool
- multimedia
- english text
- word processing
- text to speech synthesis
- object detection
- speaker verification
- anomaly detection
- automatic detection
- hidden markov models
- detection algorithm
- detection method
- multi modal
- detection accuracy
- real world
- detection rate
- feature selection