Picture for Alan Yuille

Alan Yuille

Johns Hopkins University

Play to Generalize: Learning to Reason Through Game Play

Add code
Jun 09, 2025
Viaarxiv icon

PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation

Add code
May 27, 2025
Viaarxiv icon

Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering

Add code
May 25, 2025
Viaarxiv icon

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

Add code
May 20, 2025
Viaarxiv icon

SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models

Add code
May 01, 2025
Viaarxiv icon

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Add code
Apr 30, 2025
Viaarxiv icon

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning

Add code
Apr 28, 2025
Viaarxiv icon

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Add code
Apr 13, 2025
Viaarxiv icon

DINeMo: Learning Neural Mesh Models with no 3D Annotations

Add code
Mar 26, 2025
Viaarxiv icon

X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second

Add code
Mar 09, 2025
Viaarxiv icon