Alert button
Picture for Zhixi Cai

Zhixi Cai

Alert button

JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups

Add code
Bookmark button
Alert button
Apr 06, 2024
Simindokht Jahangard, Zhixi Cai, Shiki Wen, Hamid Rezatofighi

Viaarxiv icon

HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

Add code
Bookmark button
Alert button
Mar 19, 2024
Fucai Ke, Zhixi Cai, Simindokht Jahangard, Weiqing Wang, Pari Delir Haghighi, Hamid Rezatofighi

Figure 1 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 2 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 3 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 4 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Viaarxiv icon

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Add code
Bookmark button
Alert button
Nov 26, 2023
Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Kalin Stefanov

Viaarxiv icon

Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase

Add code
Bookmark button
Alert button
May 11, 2023
Shreya Ghosh, Rakibul Hasan, Pradyumna Agrawal, Zhixi Cai, Susannah Soon, Abhinav Dhall, Tom Gedeon

Figure 1 for Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
Figure 2 for Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
Viaarxiv icon

"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

Add code
Bookmark button
Alert button
May 05, 2023
Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat

Figure 1 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 2 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 3 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Figure 4 for "Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Viaarxiv icon

MARLIN: Masked Autoencoder for facial video Representation LearnINg

Add code
Bookmark button
Alert button
Nov 12, 2022
Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat

Figure 1 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 2 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 3 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Figure 4 for MARLIN: Masked Autoencoder for facial video Representation LearnINg
Viaarxiv icon

Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization

Add code
Bookmark button
Alert button
Apr 13, 2022
Zhixi Cai, Kalin Stefanov, Abhinav Dhall, Munawar Hayat

Figure 1 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 2 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 3 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 4 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Viaarxiv icon