Picture for Xing Wei

Xing Wei

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Add code
Mar 24, 2026
Viaarxiv icon

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

Add code
Feb 11, 2026
Viaarxiv icon

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Add code
Feb 03, 2026
Viaarxiv icon

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation

Add code
Sep 26, 2025
Figure 1 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 2 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 3 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 4 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Viaarxiv icon

GPG-HT: Generalized Policy Gradient with History-Aware Decision Transformer for Probabilistic Path Planning

Add code
Aug 24, 2025
Viaarxiv icon

FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving

Add code
May 23, 2025
Viaarxiv icon

TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy

Add code
May 19, 2025
Figure 1 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 2 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 3 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Figure 4 for TACOcc:Target-Adaptive Cross-Modal Fusion with Volume Rendering for 3D Semantic Occupancy
Viaarxiv icon

VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction

Add code
Mar 25, 2025
Viaarxiv icon

Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution

Add code
Dec 04, 2024
Figure 1 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 2 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 3 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 4 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Viaarxiv icon