Picture for Philip Torr

Philip Torr

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Towards Reliable Identification of Diffusion-based Image Manipulations

Add code
Jun 05, 2025
Viaarxiv icon

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Add code
May 30, 2025
Viaarxiv icon

Revisiting Uncertainty Estimation and Calibration of Large Language Models

Add code
May 29, 2025
Viaarxiv icon

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

Add code
May 29, 2025
Viaarxiv icon

Large Language Models Miss the Multi-Agent Mark

Add code
May 27, 2025
Viaarxiv icon

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Add code
May 26, 2025
Viaarxiv icon

CHAOS: Chart Analysis with Outlier Samples

Add code
May 22, 2025
Viaarxiv icon

AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research

Add code
May 17, 2025
Viaarxiv icon