Picture for Haowen Sun

Haowen Sun

R2RDreamer: 3D-aware Data Augmentation for Spatially-generalized 2D Manipulation Policies

Add code
Jun 15, 2026
Viaarxiv icon

OASIS: Observation-Action Space Alignment via SE(3) Trajectory Prediction for Robotic Manipulation

Add code
May 25, 2026
Viaarxiv icon

AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation

Add code
Apr 13, 2026
Viaarxiv icon

RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph

Add code
Nov 11, 2025
Viaarxiv icon

Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning

Add code
Jul 09, 2025
Figure 1 for Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
Figure 2 for Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
Figure 3 for Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
Figure 4 for Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning
Viaarxiv icon

PointVDP: Learning View-Dependent Projection by Fireworks Rays for 3D Point Cloud Segmentation

Add code
Jul 09, 2025
Viaarxiv icon

VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism

Add code
Jun 10, 2025
Figure 1 for VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism
Figure 2 for VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism
Figure 3 for VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism
Figure 4 for VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism
Viaarxiv icon

PRISM: Projection-based Reward Integration for Scene-Aware Real-to-Sim-to-Real Transfer with Few Demonstrations

Add code
Apr 29, 2025
Viaarxiv icon

Exploring Forgetting in Large Language Model Pre-Training

Add code
Oct 22, 2024
Figure 1 for Exploring Forgetting in Large Language Model Pre-Training
Figure 2 for Exploring Forgetting in Large Language Model Pre-Training
Figure 3 for Exploring Forgetting in Large Language Model Pre-Training
Figure 4 for Exploring Forgetting in Large Language Model Pre-Training
Viaarxiv icon

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

Add code
Aug 29, 2024
Viaarxiv icon