Picture for Philip Torr

Philip Torr

As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

Add code
Mar 19, 2024
Figure 1 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 2 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 3 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 4 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Viaarxiv icon

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Mar 19, 2024
Viaarxiv icon

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Add code
Mar 18, 2024
Viaarxiv icon

An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models

Add code
Mar 14, 2024
Viaarxiv icon

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Add code
Mar 14, 2024
Figure 1 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 2 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 3 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 4 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Viaarxiv icon

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Add code
Mar 07, 2024
Figure 1 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 2 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 3 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 4 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Viaarxiv icon

Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress

Add code
Feb 29, 2024
Viaarxiv icon

Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

Add code
Feb 22, 2024
Figure 1 for Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images
Figure 2 for Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images
Figure 3 for Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images
Figure 4 for Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images
Viaarxiv icon

Corrective Machine Unlearning

Add code
Feb 21, 2024
Figure 1 for Corrective Machine Unlearning
Figure 2 for Corrective Machine Unlearning
Figure 3 for Corrective Machine Unlearning
Figure 4 for Corrective Machine Unlearning
Viaarxiv icon

Can Large Language Model Agents Simulate Human Trust Behaviors?

Add code
Feb 07, 2024
Figure 1 for Can Large Language Model Agents Simulate Human Trust Behaviors?
Figure 2 for Can Large Language Model Agents Simulate Human Trust Behaviors?
Figure 3 for Can Large Language Model Agents Simulate Human Trust Behaviors?
Figure 4 for Can Large Language Model Agents Simulate Human Trust Behaviors?
Viaarxiv icon