Picture for Rongtao Xu

Rongtao Xu

CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion

Add code
Oct 14, 2025
Viaarxiv icon

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

Add code
Sep 16, 2025
Viaarxiv icon

$\mathcal{P}^3$: Toward Versatile Embodied Agents

Add code
Aug 09, 2025
Viaarxiv icon

3D-MoRe: Unified Modal-Contextual Reasoning for Embodied Question Answering

Add code
Jul 16, 2025
Viaarxiv icon

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Add code
Jun 10, 2025
Viaarxiv icon

SAMamba: Adaptive State Space Modeling with Hierarchical Vision for Infrared Small Target Detection

Add code
May 29, 2025
Viaarxiv icon

FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation

Add code
May 23, 2025
Figure 1 for FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation
Figure 2 for FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation
Figure 3 for FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation
Figure 4 for FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation
Viaarxiv icon

Image Recognition with Online Lightweight Vision Transformer: A Survey

Add code
May 06, 2025
Viaarxiv icon

RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation

Add code
May 03, 2025
Viaarxiv icon

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation

Add code
Apr 30, 2025
Viaarxiv icon