Picture for Yanyuan Qiao

Yanyuan Qiao

Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

BadNAVer: Exploring Jailbreak Attacks On Vision-and-Language Navigation

Add code
May 18, 2025
Viaarxiv icon

COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation

Add code
Mar 31, 2025
Viaarxiv icon

FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks

Add code
Mar 18, 2025
Figure 1 for FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Figure 2 for FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Figure 3 for FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Figure 4 for FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Viaarxiv icon

SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation

Add code
Mar 13, 2025
Viaarxiv icon

Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments

Add code
Feb 26, 2025
Figure 1 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 2 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 3 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 4 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Viaarxiv icon

General Scene Adaptation for Vision-and-Language Navigation

Add code
Jan 29, 2025
Figure 1 for General Scene Adaptation for Vision-and-Language Navigation
Figure 2 for General Scene Adaptation for Vision-and-Language Navigation
Figure 3 for General Scene Adaptation for Vision-and-Language Navigation
Figure 4 for General Scene Adaptation for Vision-and-Language Navigation
Viaarxiv icon

MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation

Add code
Oct 02, 2024
Viaarxiv icon

Effective Tuning Strategies for Generalist Robot Manipulation Policies

Add code
Oct 02, 2024
Viaarxiv icon

MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation

Add code
Sep 27, 2024
Figure 1 for MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Figure 2 for MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Figure 3 for MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Figure 4 for MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Viaarxiv icon