Picture for Philip Torr

Philip Torr

Model-agnostic Origin Attribution of Generated Images with Few-shot Examples

Add code
Apr 03, 2024
Viaarxiv icon

DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion

Add code
Mar 25, 2024
Figure 1 for DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Figure 2 for DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Figure 3 for DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Figure 4 for DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion
Viaarxiv icon

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models

Add code
Mar 21, 2024
Figure 1 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 2 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 3 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Figure 4 for RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Viaarxiv icon

On Pretraining Data Diversity for Self-Supervised Learning

Add code
Mar 20, 2024
Figure 1 for On Pretraining Data Diversity for Self-Supervised Learning
Figure 2 for On Pretraining Data Diversity for Self-Supervised Learning
Figure 3 for On Pretraining Data Diversity for Self-Supervised Learning
Figure 4 for On Pretraining Data Diversity for Self-Supervised Learning
Viaarxiv icon

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Add code
Mar 20, 2024
Figure 1 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 2 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 3 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 4 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Viaarxiv icon

As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

Add code
Mar 19, 2024
Figure 1 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 2 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 3 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Figure 4 for As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?
Viaarxiv icon

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Add code
Mar 19, 2024
Figure 1 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 2 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 3 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Figure 4 for DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Viaarxiv icon

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Add code
Mar 18, 2024
Figure 1 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 2 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 3 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Figure 4 for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Viaarxiv icon

GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

Add code
Mar 14, 2024
Figure 1 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 2 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 3 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Figure 4 for GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing
Viaarxiv icon

An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models

Add code
Mar 14, 2024
Figure 1 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 2 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 3 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Figure 4 for An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language Models
Viaarxiv icon