Picture for Kai Tang

Kai Tang

STAR: Stage-Wise Attention-Guided Token Reduction for Efficient Large Vision-Language Models Inference

Add code
May 18, 2025
Viaarxiv icon

Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models

Add code
May 18, 2025
Viaarxiv icon

Beyond Fixed Variables: Expanding-variate Time Series Forecasting via Flat Scheme and Spatio-temporal Focal Learning

Add code
Feb 21, 2025
Viaarxiv icon

Monocular Event-Inertial Odometry with Adaptive decay-based Time Surface and Polarity-aware Tracking

Add code
Sep 21, 2024
Viaarxiv icon

TDT Loss Takes It All: Integrating Temporal Dependencies among Targets into Non-Autoregressive Time Series Forecasting

Add code
Jun 07, 2024
Viaarxiv icon

ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model

Add code
Apr 13, 2024
Viaarxiv icon

SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

Add code
Mar 28, 2024
Figure 1 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Figure 2 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Figure 3 for SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
Viaarxiv icon

Online Robot Navigation and and Manipulation with Distilled Vision-Language Models

Add code
Jan 30, 2024
Viaarxiv icon

Generalized Label-Efficient 3D Scene Parsing via Hierarchical Feature Aligned Pre-Training and Region-Aware Fine-tuning

Add code
Dec 01, 2023
Viaarxiv icon

Coco-LIC: Continuous-Time Tightly-Coupled LiDAR-Inertial-Camera Odometry using Non-Uniform B-spline

Add code
Sep 18, 2023
Viaarxiv icon