Picture for Shaofeng Liang

Shaofeng Liang

MemoGen: Can Past Experience Improve Future Text-to-Image Generation?

Add code
Jun 02, 2026
Viaarxiv icon

Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment

Add code
May 20, 2026
Viaarxiv icon

Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control

Add code
May 14, 2026
Viaarxiv icon

$Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models

Add code
Apr 26, 2026
Viaarxiv icon

Oracle Noise: Faster Semantic Spherical Alignment for Interpretable Latent Optimization

Add code
Apr 26, 2026
Viaarxiv icon

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents

Add code
Feb 26, 2026
Viaarxiv icon

Guided Path Sampling: Steering Diffusion Models Back on Track with Principled Path Guidance

Add code
Dec 28, 2025
Viaarxiv icon

Wavelet-based Multi-View Fusion of 4D Radar Tensor and Camera for Robust 3D Object Detection

Add code
Dec 28, 2025
Viaarxiv icon

MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion

Add code
Dec 16, 2025
Figure 1 for MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion
Figure 2 for MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion
Figure 3 for MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion
Figure 4 for MMDrive: Interactive Scene Understanding Beyond Vision with Multi-representational Fusion
Viaarxiv icon

Da Yu: Towards USV-Based Image Captioning for Waterway Surveillance and Scene Understanding

Add code
Jun 24, 2025
Viaarxiv icon