Picture for Yan Yan

Yan Yan

RayMap3R: Inference-Time RayMap for Dynamic 3D Reconstruction

Add code
Mar 21, 2026
Viaarxiv icon

VLA Knows Its Limits

Add code
Feb 24, 2026
Viaarxiv icon

Bi-Level Prompt Optimization for Multimodal LLM-as-a-Judge

Add code
Feb 11, 2026
Viaarxiv icon

Real-Time Robot Execution with Masked Action Chunking

Add code
Jan 27, 2026
Viaarxiv icon

CogniMap3D: Cognitive 3D Mapping and Rapid Retrieval

Add code
Jan 13, 2026
Viaarxiv icon

Consistent Instance Field for Dynamic Scene Understanding

Add code
Dec 16, 2025
Figure 1 for Consistent Instance Field for Dynamic Scene Understanding
Figure 2 for Consistent Instance Field for Dynamic Scene Understanding
Figure 3 for Consistent Instance Field for Dynamic Scene Understanding
Figure 4 for Consistent Instance Field for Dynamic Scene Understanding
Viaarxiv icon

Distill Video Datasets into Images

Add code
Dec 16, 2025
Figure 1 for Distill Video Datasets into Images
Figure 2 for Distill Video Datasets into Images
Figure 3 for Distill Video Datasets into Images
Figure 4 for Distill Video Datasets into Images
Viaarxiv icon

From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields

Add code
Dec 13, 2025
Viaarxiv icon

VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction

Add code
Dec 11, 2025
Figure 1 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 2 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 3 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Figure 4 for VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
Viaarxiv icon

TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing

Add code
Dec 10, 2025
Viaarxiv icon