Picture for Philip Torr

Philip Torr

Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments

Add code
Jun 12, 2026
Viaarxiv icon

When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs

Add code
Jun 12, 2026
Viaarxiv icon

A Low-Rank Subspace Analysis of LLM Interventions

Add code
Jun 12, 2026
Viaarxiv icon

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Add code
Jun 09, 2026
Viaarxiv icon

Pretraining Language Models on Historical Text

Add code
Jun 02, 2026
Viaarxiv icon

Plan2Map: A Multimodal Benchmark for Document-Grounded Geospatial Boundary Reconstruction from Planning Records

Add code
Jun 01, 2026
Viaarxiv icon

SeClaw: Spec-Driven Security Task Synthesis for Evaluating Autonomous Agents

Add code
Jun 01, 2026
Viaarxiv icon

ELAN4D: Embodiment-Centric 4D Supervision for Vision-Language-Action Models via Plug-and-Play Adaptation

Add code
May 28, 2026
Viaarxiv icon

$D^2$-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

Add code
May 25, 2026
Viaarxiv icon

The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models

Add code
May 23, 2026
Viaarxiv icon