Login / Signup
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection.
Heqing Zou
Meng Shen
Yuchen Hu
Chen Chen
Eng Siong Chng
Deepu Rajan
Published in:
CoRR (2024)
Keyphrases
</>
audio visual
multi modal
visual information
audio visual speech recognition
multi stream
video summarization
temporal context
person authentication
emotion recognition
visual data
multimedia
data sets
image representation
moving objects
high level
vehicle detection
computer vision
multimodal fusion