Picture for Zhiwen Fan

Zhiwen Fan

NavTrust: Benchmarking Trustworthiness for Embodied Navigation

Add code
Mar 19, 2026
Viaarxiv icon

NanoGS: Training-Free Gaussian Splat Simplification

Add code
Mar 17, 2026
Viaarxiv icon

Egocentric World Model for Photorealistic Hand-Object Interaction Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

RuleSmith: Multi-Agent LLMs for Automated Game Balancing

Add code
Feb 05, 2026
Viaarxiv icon

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Add code
Jul 16, 2025
Figure 1 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 2 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 3 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 4 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Viaarxiv icon

Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions

Add code
Jul 10, 2025
Figure 1 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 2 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 3 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Figure 4 for Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Viaarxiv icon

CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy

Add code
Jun 06, 2025
Viaarxiv icon

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Add code
May 26, 2025
Viaarxiv icon

Generative AI for Autonomous Driving: Frontiers and Opportunities

Add code
May 13, 2025
Viaarxiv icon