Picture for Lu Sheng

Lu Sheng

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

Add code
Oct 08, 2025
Viaarxiv icon

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Add code
Aug 26, 2025
Figure 1 for VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Figure 2 for VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Figure 3 for VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Figure 4 for VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Viaarxiv icon

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Add code
Jun 24, 2025
Viaarxiv icon

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

Add code
Jun 11, 2025
Viaarxiv icon

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Add code
Jun 04, 2025
Viaarxiv icon

Personalize Anything for Free with Diffusion Transformer

Add code
Mar 16, 2025
Viaarxiv icon

HexPlane Representation for 3D Semantic Scene Understanding

Add code
Mar 07, 2025
Figure 1 for HexPlane Representation for 3D Semantic Scene Understanding
Figure 2 for HexPlane Representation for 3D Semantic Scene Understanding
Figure 3 for HexPlane Representation for 3D Semantic Scene Understanding
Figure 4 for HexPlane Representation for 3D Semantic Scene Understanding
Viaarxiv icon

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Add code
Dec 05, 2024
Figure 1 for Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
Figure 2 for Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
Figure 3 for Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
Figure 4 for Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
Viaarxiv icon

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Add code
Dec 04, 2024
Viaarxiv icon

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Add code
Dec 04, 2024
Figure 1 for MV-Adapter: Multi-view Consistent Image Generation Made Easy
Figure 2 for MV-Adapter: Multi-view Consistent Image Generation Made Easy
Figure 3 for MV-Adapter: Multi-view Consistent Image Generation Made Easy
Figure 4 for MV-Adapter: Multi-view Consistent Image Generation Made Easy
Viaarxiv icon