Picture for Guangrun Wang

Guangrun Wang

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion

Add code
Jan 07, 2026
Viaarxiv icon

Stable Language Guidance for Vision-Language-Action Models

Add code
Jan 07, 2026
Viaarxiv icon

ACD: Direct Conditional Control for Video Diffusion Models via Attention Supervision

Add code
Dec 24, 2025
Viaarxiv icon

UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning

Add code
Sep 26, 2025
Figure 1 for UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
Figure 2 for UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
Figure 3 for UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
Figure 4 for UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
Viaarxiv icon

GS: Generative Segmentation via Label Diffusion

Add code
Aug 27, 2025
Figure 1 for GS: Generative Segmentation via Label Diffusion
Figure 2 for GS: Generative Segmentation via Label Diffusion
Figure 3 for GS: Generative Segmentation via Label Diffusion
Figure 4 for GS: Generative Segmentation via Label Diffusion
Viaarxiv icon

ReaLM: Reflection-Enhanced Autonomous Reasoning with Small Language Models

Add code
Aug 17, 2025
Viaarxiv icon

One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose

Add code
Aug 06, 2025
Viaarxiv icon

SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks

Add code
Jun 17, 2025
Viaarxiv icon

Implicit Neural Representations for Chemical Reaction Paths

Add code
Feb 20, 2025
Viaarxiv icon