Picture for Rongtao Xu

Rongtao Xu

$\mathcal{P}^3$: Toward Versatile Embodied Agents

Add code
Aug 09, 2025
Viaarxiv icon

3D-MoRe: Unified Modal-Contextual Reasoning for Embodied Question Answering

Add code
Jul 16, 2025
Viaarxiv icon

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Add code
Jun 10, 2025
Viaarxiv icon

SAMamba: Adaptive State Space Modeling with Hierarchical Vision for Infrared Small Target Detection

Add code
May 29, 2025
Viaarxiv icon

FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation

Add code
May 23, 2025
Viaarxiv icon

Image Recognition with Online Lightweight Vision Transformer: A Survey

Add code
May 06, 2025
Viaarxiv icon

RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation

Add code
May 03, 2025
Viaarxiv icon

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation

Add code
Apr 30, 2025
Viaarxiv icon

A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Add code
Apr 21, 2025
Figure 1 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 2 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 3 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 4 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Viaarxiv icon

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition

Add code
Apr 14, 2025
Viaarxiv icon