Picture for Philip Torr

Philip Torr

Unforgotten Safety: Preserving Safety Alignment of Large Language Models with Continual Learning

Add code
Dec 10, 2025
Viaarxiv icon

Computer-Use Agents as Judges for Generative User Interface

Add code
Nov 19, 2025
Viaarxiv icon

DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Add code
Oct 24, 2025
Viaarxiv icon

A Guardrail for Safety Preservation: When Safety-Sensitive Subspace Meets Harmful-Resistant Null-Space

Add code
Oct 16, 2025
Viaarxiv icon

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Add code
Oct 09, 2025
Viaarxiv icon

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning

Add code
Oct 08, 2025
Viaarxiv icon

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Add code
Sep 30, 2025
Figure 1 for Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Figure 2 for Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Figure 3 for Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Figure 4 for Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training
Viaarxiv icon

LLM Jailbreak Detection for (Almost) Free!

Add code
Sep 18, 2025
Figure 1 for LLM Jailbreak Detection for (Almost) Free!
Figure 2 for LLM Jailbreak Detection for (Almost) Free!
Figure 3 for LLM Jailbreak Detection for (Almost) Free!
Figure 4 for LLM Jailbreak Detection for (Almost) Free!
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Figure 1 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 2 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 3 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 4 for Interleaving Reasoning for Better Text-to-Image Generation
Viaarxiv icon

Articulate3D: Zero-Shot Text-Driven 3D Object Posing

Add code
Aug 26, 2025
Viaarxiv icon