Alert button
Picture for Philip Torr

Philip Torr

Alert button

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Bookmark button
Alert button
Mar 19, 2024
Yixuan Wu, Yizhou Wang, Shixiang Tang, Wenhao Wu, Tong He, Wanli Ouyang, Jian Wu, Philip Torr

Figure 1 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 2 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 3 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 4 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Viaarxiv icon

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Junlin Han, Filippos Kokkinos, Philip Torr

Figure 1 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 2 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 3 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 4 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Viaarxiv icon

An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models

Add code
Bookmark button
Alert button
Mar 14, 2024
Haochen Luo, Jindong Gu, Fengyuan Liu, Philip Torr

Figure 1 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 2 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 3 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 4 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Viaarxiv icon

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Add code
Bookmark button
Alert button
Mar 14, 2024
Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu

Figure 1 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 2 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 3 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 4 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Viaarxiv icon

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Add code
Bookmark button
Alert button
Mar 07, 2024
Qilang Ye, Zitong Yu, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun Cao

Figure 1 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 2 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 3 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Figure 4 for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Viaarxiv icon

Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress

Add code
Bookmark button
Alert button
Feb 29, 2024
Ameya Prabhu, Vishaal Udandarao, Philip Torr, Matthias Bethge, Adel Bibi, Samuel Albanie

Viaarxiv icon

Stop Reasoning! When Multimodal LLMs with Chain-of-Thought Reasoning Meets Adversarial Images

Add code
Bookmark button
Alert button
Feb 22, 2024
Zefeng Wang, Zhen Han, Shuo Chen, Fan Xue, Zifeng Ding, Xun Xiao, Volker Tresp, Philip Torr, Jindong Gu

Viaarxiv icon

Corrective Machine Unlearning

Add code
Bookmark button
Alert button
Feb 21, 2024
Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal

Viaarxiv icon

Can Large Language Model Agents Simulate Human Trust Behaviors?

Add code
Bookmark button
Alert button
Feb 07, 2024
Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Kai Shu, Adel Bibi, Ziniu Hu, Philip Torr, Bernard Ghanem, Guohao Li

Viaarxiv icon

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

Add code
Bookmark button
Alert button
Feb 02, 2024
Hasan Abed Al Kader Hammoud, Hani Itani, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem

Viaarxiv icon