Picture for Zhiyuan Xu

Zhiyuan Xu

FlowDepth: Decoupling Optical Flow for Self-Supervised Monocular Depth Estimation

Add code
Mar 28, 2024
Viaarxiv icon

Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

Add code
Mar 15, 2024
Viaarxiv icon

A Survey on Robotics with Foundation Models: toward Embodied AI

Add code
Feb 04, 2024
Viaarxiv icon

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

Add code
Feb 01, 2024
Viaarxiv icon

Visual Robotic Manipulation with Depth-Aware Pretraining

Add code
Jan 17, 2024
Viaarxiv icon

SWBT: Similarity Weighted Behavior Transformer with the Imperfect Demonstration for Robotic Manipulation

Add code
Jan 17, 2024
Viaarxiv icon

An Efficient Generalizable Framework for Visuomotor Policies via Control-aware Augmentation and Privilege-guided Distillation

Add code
Jan 17, 2024
Viaarxiv icon

Object-Centric Instruction Augmentation for Robotic Manipulation

Add code
Jan 05, 2024
Figure 1 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 2 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 3 for Object-Centric Instruction Augmentation for Robotic Manipulation
Figure 4 for Object-Centric Instruction Augmentation for Robotic Manipulation
Viaarxiv icon

Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering

Add code
Dec 20, 2023
Viaarxiv icon

Cross-Modal Reasoning with Event Correlation for Video Question Answering

Add code
Dec 20, 2023
Viaarxiv icon