Picture for Cewu Lu

Cewu Lu

A Surprisingly Efficient Representation for Multi-Finger Grasping

Add code
Aug 05, 2024
Figure 1 for A Surprisingly Efficient Representation for Multi-Finger Grasping
Figure 2 for A Surprisingly Efficient Representation for Multi-Finger Grasping
Figure 3 for A Surprisingly Efficient Representation for Multi-Finger Grasping
Figure 4 for A Surprisingly Efficient Representation for Multi-Finger Grasping
Viaarxiv icon

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Add code
Jul 29, 2024
Figure 1 for Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Figure 2 for Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Figure 3 for Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Figure 4 for Take A Step Back: Rethinking the Two Stages in Visual Reasoning
Viaarxiv icon

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control

Add code
Jul 20, 2024
Figure 1 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 2 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 3 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 4 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Viaarxiv icon

TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

Add code
Jul 03, 2024
Figure 1 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 2 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 3 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 4 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Viaarxiv icon

Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition

Add code
Jul 02, 2024
Figure 1 for Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
Figure 2 for Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
Figure 3 for Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
Figure 4 for Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition
Viaarxiv icon

HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid

Add code
Jun 28, 2024
Viaarxiv icon

Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

Add code
Jun 17, 2024
Figure 1 for Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Figure 2 for Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Figure 3 for Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Figure 4 for Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Viaarxiv icon

Low-Rank Similarity Mining for Multimodal Dataset Distillation

Add code
Jun 06, 2024
Viaarxiv icon

The SkatingVerse Workshop & Challenge: Methods and Results

Add code
May 27, 2024
Figure 1 for The SkatingVerse Workshop & Challenge: Methods and Results
Figure 2 for The SkatingVerse Workshop & Challenge: Methods and Results
Viaarxiv icon

DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model

Add code
May 12, 2024
Figure 1 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 2 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 3 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 4 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Viaarxiv icon