Picture for Wenke Xia

Wenke Xia

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction

Add code
Apr 20, 2025
Viaarxiv icon

Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection

Add code
Aug 09, 2024
Figure 1 for Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection
Figure 2 for Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection
Figure 3 for Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection
Figure 4 for Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection
Viaarxiv icon

KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance

Add code
Aug 06, 2024
Figure 1 for KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
Figure 2 for KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
Figure 3 for KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
Figure 4 for KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
Viaarxiv icon

Learning Manipulation by Predicting Interaction

Add code
Jun 01, 2024
Figure 1 for Learning Manipulation by Predicting Interaction
Figure 2 for Learning Manipulation by Predicting Interaction
Figure 3 for Learning Manipulation by Predicting Interaction
Figure 4 for Learning Manipulation by Predicting Interaction
Viaarxiv icon

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

Add code
May 30, 2024
Figure 1 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 2 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 3 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 4 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Viaarxiv icon

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs

Add code
Nov 08, 2023
Figure 1 for Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs
Figure 2 for Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs
Figure 3 for Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs
Figure 4 for Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs
Viaarxiv icon

Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

Add code
Apr 27, 2023
Viaarxiv icon

Revisiting Pre-training in Audio-Visual Learning

Add code
Feb 17, 2023
Viaarxiv icon

Balanced Audiovisual Dataset for Imbalance Analysis

Add code
Feb 14, 2023
Figure 1 for Balanced Audiovisual Dataset for Imbalance Analysis
Figure 2 for Balanced Audiovisual Dataset for Imbalance Analysis
Figure 3 for Balanced Audiovisual Dataset for Imbalance Analysis
Figure 4 for Balanced Audiovisual Dataset for Imbalance Analysis
Viaarxiv icon

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat

Add code
Jan 14, 2023
Viaarxiv icon