Picture for Xiao Tan

Xiao Tan

Combating Visual Neglect and Semantic Drift in Large Multimodal Models for Enhanced Cross-Modal Retrieval

Add code
Apr 28, 2026
Viaarxiv icon

LoReC: Rethinking Large Language Models for Graph Data Analysis

Add code
Apr 20, 2026
Viaarxiv icon

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Add code
Mar 19, 2026
Viaarxiv icon

Speed3R: Sparse Feed-forward 3D Reconstruction Models

Add code
Mar 09, 2026
Viaarxiv icon

From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing

Add code
Mar 01, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Add code
Nov 09, 2025
Figure 1 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 2 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 3 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 4 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Viaarxiv icon

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving

Add code
Nov 09, 2025
Figure 1 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 2 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 3 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Figure 4 for AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
Viaarxiv icon

VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving

Add code
Nov 09, 2025
Viaarxiv icon

Safe Navigation under State Uncertainty: Online Adaptation for Robust Control Barrier Functions

Add code
Aug 26, 2025
Viaarxiv icon