Login / Signup
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset.
Zhixi Cai
Shreya Ghosh
Aman Pankaj Adatia
Munawar Hayat
Abhinav Dhall
Kalin Stefanov
Published in:
CoRR (2023)
Keyphrases
</>
audio visual
multi modal
visual information
video summarization
visual data
audio visual speech recognition
temporal context
multimedia
multi stream
emotion recognition
database
human computer interaction
feature set
visual features
nearest neighbor
feature space
multimodal fusion