Picture for Philip Torr

Philip Torr

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Add code
Jun 23, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon

How Visual Representations Map to Language Feature Space in Multimodal LLMs

Add code
Jun 13, 2025
Viaarxiv icon

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Add code
Jun 10, 2025
Viaarxiv icon

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Towards Reliable Identification of Diffusion-based Image Manipulations

Add code
Jun 05, 2025
Viaarxiv icon

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Add code
May 30, 2025
Viaarxiv icon

Revisiting Uncertainty Estimation and Calibration of Large Language Models

Add code
May 29, 2025
Viaarxiv icon

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

Add code
May 29, 2025
Viaarxiv icon

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon