Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes.
Weifeng LiuTianyi SheJiawei LiuRun WangDongyu YaoZiyou LiangPublished in: CoRR (2024)
Keyphrases
- visual speech
- hidden markov models
- lip reading
- audio visual speech recognition
- visual information
- speaker identification
- temporal information
- noisy environments
- visual data
- acoustic features
- audio signals
- video signals
- speech signal
- cross modal
- keyword spotting
- visual features
- temporal constraints
- temporal analysis
- spatial and temporal
- space time
- spatio temporal
- multimedia
- temporal databases
- temporal reasoning
- human visual system
- text to speech
- speech recognition