Synthetic Speech Detection through Audio Folding.
Davide SalviPaolo BestaginiStefano TubaroPublished in: MAD@ICMR (2023)
Keyphrases
- audio visual
- audio stream
- voice activity detection
- automatic detection
- speaker identification
- emotion recognition
- multimedia
- audio signals
- detection method
- object detection
- digital audio
- speech processing
- detection accuracy
- soccer video
- false alarms
- signal processing
- noisy environments
- event detection
- detection rate
- broadcast news
- multi modal
- visual data
- audio recordings
- prosodic features
- acoustic signals
- linear predictive coding
- real world
- speech music discrimination