Sign in
Zhixi Cai
ORCID
Publication Activity (10 Years)
Years Active: 2022-2023
Publications (10 Years): 8
Top Topics
Multi Stream
Visual Information
Visualization Tool
Forgery Detection
Top Venues
CoRR
Comput. Vis. Image Underst.
CVPR
</>
Publications
</>
Shreya Ghosh
,
Zhixi Cai
,
Parul Gupta
,
Garima Sharma
,
Abhinav Dhall
,
Munawar Hayat
,
Tom Gedeon
Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit.
CoRR
(2023)
Zhixi Cai
,
Shreya Ghosh
,
Kalin Stefanov
,
Abhinav Dhall
,
Jianfei Cai
,
Hamid Rezatofighi
,
Reza Haffari
,
Munawar Hayat
MARLIN: Masked Autoencoder for facial video Representation LearnINg.
CVPR
(2023)
Shreya Ghosh
,
Rakibul Hasan
,
Pradyumna Agrawal
,
Zhixi Cai
,
Susannah Soon
,
Abhinav Dhall
,
Tom Gedeon
Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase.
CoRR
(2023)
Zhixi Cai
,
Shreya Ghosh
,
Abhinav Dhall
,
Tom Gedeon
,
Kalin Stefanov
,
Munawar Hayat
"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization.
CoRR
(2023)
Zhixi Cai
,
Shreya Ghosh
,
Abhinav Dhall
,
Tom Gedeon
,
Kalin Stefanov
,
Munawar Hayat
: A large scale benchmark for content driven audio-visual forgery detection and localization.
Comput. Vis. Image Underst.
236 (2023)
Zhixi Cai
,
Shreya Ghosh
,
Aman Pankaj Adatia
,
Munawar Hayat
,
Abhinav Dhall
,
Kalin Stefanov
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset.
CoRR
(2023)
Zhixi Cai
,
Shreya Ghosh
,
Kalin Stefanov
,
Abhinav Dhall
,
Jianfei Cai
,
Hamid Rezatofighi
,
Reza Haffari
,
Munawar Hayat
MARLIN: Masked Autoencoder for facial video Representation LearnINg.
CoRR
(2022)
Zhixi Cai
,
Kalin Stefanov
,
Abhinav Dhall
,
Munawar Hayat
Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization.
CoRR
(2022)