Picture for Philip Torr

Philip Torr

Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing

Add code
Jul 29, 2024
Figure 1 for Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Figure 2 for Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Figure 3 for Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Figure 4 for Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Viaarxiv icon

FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging

Add code
Jul 11, 2024
Figure 1 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 2 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 3 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Figure 4 for FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging
Viaarxiv icon

Extracting Training Data from Document-Based VQA Models

Add code
Jul 11, 2024
Viaarxiv icon

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

Add code
Jul 05, 2024
Viaarxiv icon

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

Add code
Jul 01, 2024
Figure 1 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 2 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 3 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 4 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Add code
Jun 20, 2024
Viaarxiv icon

Localizing Events in Videos with Multimodal Queries

Add code
Jun 14, 2024
Figure 1 for Localizing Events in Videos with Multimodal Queries
Figure 2 for Localizing Events in Videos with Multimodal Queries
Figure 3 for Localizing Events in Videos with Multimodal Queries
Figure 4 for Localizing Events in Videos with Multimodal Queries
Viaarxiv icon

Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation

Add code
Jun 07, 2024
Figure 1 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 2 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 3 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Figure 4 for Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation
Viaarxiv icon