Point to the Hidden: Exposing Speech Audio Splicing via Signal Pointer Nets.
Denise MoussaGermans HirschSebastian WankerlChristian RiessPublished in: CoRR (2023)
Keyphrases
- signal processing
- audio visual
- audio stream
- broadcast news
- audio signals
- speaker identification
- speech processing
- data structure
- acoustic signals
- audio signal
- acoustic signal
- noisy environments
- text to speech
- audio video
- audio recordings
- emotion recognition
- automatic speech recognition
- digital audio
- speech recognition
- multimedia
- speech music discrimination
- fundamental frequency
- frequency domain
- visual information
- cepstral features
- voice activity detection
- visual speech
- audio features
- automatic transcription
- non stationary
- signal dependent
- language acquisition
- linear predictive coding
- speech segments
- pattern recognition
- speech corpus
- multi modal
- music information retrieval
- dna sequences
- human language
- speaker recognition
- multi stream