Picture for Philip Torr

Philip Torr

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Add code
May 27, 2025
Viaarxiv icon

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Add code
May 26, 2025
Viaarxiv icon

CHAOS: Chart Analysis with Outlier Samples

Add code
May 22, 2025
Viaarxiv icon

AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research

Add code
May 17, 2025
Viaarxiv icon

Hadamard product in deep learning: Introduction, Advances and Challenges

Add code
Apr 17, 2025
Viaarxiv icon

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Add code
Apr 15, 2025
Figure 1 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 2 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 3 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 4 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Viaarxiv icon

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Add code
Mar 18, 2025
Viaarxiv icon

Attacking Multimodal OS Agents with Malicious Image Patches

Add code
Mar 13, 2025
Figure 1 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 2 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 3 for Attacking Multimodal OS Agents with Malicious Image Patches
Figure 4 for Attacking Multimodal OS Agents with Malicious Image Patches
Viaarxiv icon

Do Sparse Autoencoders Generalize? A Case Study of Answerability

Add code
Feb 27, 2025
Viaarxiv icon

Implicit Neural Representations for Chemical Reaction Paths

Add code
Feb 20, 2025
Viaarxiv icon