Picture for Wufei Ma

Wufei Ma

Johns Hopkins University

LychSim: A Controllable and Interactive Simulation Framework for Vision Research

Add code
May 12, 2026
Viaarxiv icon

PASR: Pose-Aware 3D Shape Retrieval from Occluded Single Views

Add code
Apr 24, 2026
Viaarxiv icon

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Add code
Apr 06, 2026
Viaarxiv icon

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Add code
Dec 18, 2025
Figure 1 for Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Figure 2 for Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Figure 3 for Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Figure 4 for Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Viaarxiv icon

SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models

Add code
May 01, 2025
Figure 1 for SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Figure 2 for SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Figure 3 for SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Figure 4 for SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Viaarxiv icon

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning

Add code
Apr 28, 2025
Figure 1 for SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Figure 2 for SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Figure 3 for SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Figure 4 for SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Viaarxiv icon

DINeMo: Learning Neural Mesh Models with no 3D Annotations

Add code
Mar 26, 2025
Figure 1 for DINeMo: Learning Neural Mesh Models with no 3D Annotations
Figure 2 for DINeMo: Learning Neural Mesh Models with no 3D Annotations
Figure 3 for DINeMo: Learning Neural Mesh Models with no 3D Annotations
Figure 4 for DINeMo: Learning Neural Mesh Models with no 3D Annotations
Viaarxiv icon

PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models

Add code
Feb 13, 2025
Viaarxiv icon

PulseCheck457: A Diagnostic Benchmark for Comprehensive Spatial Reasoning of Large Multimodal Models

Add code
Feb 12, 2025
Viaarxiv icon

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

Add code
Dec 10, 2024
Viaarxiv icon