Picture for Xingtao Wang

Xingtao Wang

Towards Accurate Single Panoramic 3D Detection: A Semantic Gaussian Centric Approach

Add code
May 14, 2026
Viaarxiv icon

PairDropGS: Paired Dropout-Induced Consistency Regularization for Sparse-View Gaussian Splatting

Add code
May 13, 2026
Viaarxiv icon

Hyperbolic Distillation: Geometry-Guided Cross-Modal Transfer for Robust 3D Object Detection

Add code
May 11, 2026
Viaarxiv icon

MRT: Learning Compact Representations with Mixed RWKV-Transformer for Extreme Image Compression

Add code
Nov 14, 2025
Viaarxiv icon

An End-to-End Room Geometry Constrained Depth Estimation Framework for Indoor Panorama Images

Add code
Oct 09, 2025
Viaarxiv icon

Region-Level Context-Aware Multimodal Understanding

Add code
Aug 17, 2025
Viaarxiv icon

T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates

Add code
Jul 10, 2025
Figure 1 for T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates
Figure 2 for T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates
Figure 3 for T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates
Figure 4 for T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates
Viaarxiv icon

Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning

Add code
May 26, 2025
Figure 1 for Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
Figure 2 for Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
Figure 3 for Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
Figure 4 for Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
Viaarxiv icon

RouteWinFormer: A Route-Window Transformer for Middle-range Attention in Image Restoration

Add code
Apr 23, 2025
Viaarxiv icon

FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation

Add code
Mar 28, 2025
Viaarxiv icon