Picture for Xing Wei

Xing Wei

ABot-Claw: A Foundation for Persistent, Cooperative, and Self-Evolving Robotic Agents

Add code
Apr 11, 2026
Viaarxiv icon

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Add code
Mar 24, 2026
Viaarxiv icon

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

Add code
Feb 11, 2026
Viaarxiv icon

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Add code
Feb 03, 2026
Viaarxiv icon

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation

Add code
Sep 26, 2025
Figure 1 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 2 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 3 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 4 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Viaarxiv icon

GPG-HT: Generalized Policy Gradient with History-Aware Decision Transformer for Probabilistic Path Planning

Add code
Aug 24, 2025
Viaarxiv icon

FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving

Add code
May 23, 2025
Viaarxiv icon

TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy

Add code
May 19, 2025
Figure 1 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 2 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 3 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 4 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Viaarxiv icon

VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction

Add code
Mar 25, 2025
Viaarxiv icon