Audio Spectrogram Transformer for Synthetic Speech Detection via Speech Formant Analysis.
Luca CuccovilloMilica GerhardtPatrick AichrothPublished in: WIFS (2023)
Keyphrases
- speech signal
- speech recognition
- automatic speech recognition
- speaker identification
- audio visual
- spectral analysis
- audio stream
- linear prediction
- pattern analysis
- text to speech
- emotion recognition
- audio signals
- vocal tract
- spontaneous speech
- acoustic features
- false positives
- object detection
- broadcast news
- voice activity detection
- cepstral features
- speech processing
- noisy environments
- non stationary
- pattern recognition
- speaker recognition
- visual information
- fault diagnosis
- fuzzy logic
- prosodic features
- data analysis
- automatic transcription
- linear predictive coding
- feature extraction