Picture for Philip Torr

Philip Torr

Revisiting Uncertainty Estimation and Calibration of Large Language Models

Add code
May 29, 2025
Viaarxiv icon

Large Language Models Miss the Multi-Agent Mark

Add code
May 27, 2025
Viaarxiv icon

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Add code
May 26, 2025
Viaarxiv icon

CHAOS: Chart Analysis with Outlier Samples

Add code
May 22, 2025
Viaarxiv icon

AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research

Add code
May 17, 2025
Viaarxiv icon

Hadamard product in deep learning: Introduction, Advances and Challenges

Add code
Apr 17, 2025
Viaarxiv icon

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Add code
Apr 15, 2025
Figure 1 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 2 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 3 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 4 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Viaarxiv icon

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Add code
Mar 18, 2025
Viaarxiv icon

Attacking Multimodal OS Agents with Malicious Image Patches

Add code
Mar 13, 2025
Figure 1 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 2 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 3 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 4 for Attacking Multimodal OS Agents with Malicious Image Patches
Viaarxiv icon