Picture for Philip Torr

Philip Torr

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

Unforgotten Safety: Preserving Safety Alignment of Large Language Models with Continual Learning

Add code
Dec 10, 2025
Viaarxiv icon

Computer-Use Agents as Judges for Generative User Interface

Add code
Nov 19, 2025
Viaarxiv icon

DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Add code
Oct 24, 2025
Viaarxiv icon

A Guardrail for Safety Preservation: When Safety-Sensitive Subspace Meets Harmful-Resistant Null-Space

Add code
Oct 16, 2025
Viaarxiv icon

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Add code
Oct 09, 2025
Viaarxiv icon

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning

Add code
Oct 08, 2025
Viaarxiv icon

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Add code
Sep 30, 2025
Viaarxiv icon

LLM Jailbreak Detection for (Almost) Free!

Add code
Sep 18, 2025
Figure 1 for LLM Jailbreak Detection for (Almost) Free!
Figure 2 for LLM Jailbreak Detection for (Almost) Free!
Figure 3 for LLM Jailbreak Detection for (Almost) Free!
Figure 4 for LLM Jailbreak Detection for (Almost) Free!
Viaarxiv icon