Picture for Xiao Wang

Xiao Wang

School of Computer and Information, Hefei University of Technology, China

TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection

Add code
Mar 18, 2025
Viaarxiv icon

DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model

Add code
Mar 17, 2025
Figure 1 for DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model
Figure 2 for DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model
Figure 3 for DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model
Figure 4 for DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model
Viaarxiv icon

AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding

Add code
Mar 16, 2025
Viaarxiv icon

CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement

Add code
Mar 11, 2025
Figure 1 for CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement
Figure 2 for CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement
Figure 3 for CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement
Figure 4 for CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement
Viaarxiv icon

Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection

Add code
Mar 10, 2025
Figure 1 for Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
Figure 2 for Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
Figure 3 for Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
Figure 4 for Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
Viaarxiv icon

Sign Language Translation using Frame and Event Stream: Benchmark Dataset and Algorithms

Add code
Mar 09, 2025
Viaarxiv icon

AutoMisty: A Multi-Agent LLM Framework for Automated Code Generation in the Misty Social Robot

Add code
Mar 09, 2025
Viaarxiv icon

Biomedical Foundation Model: A Survey

Add code
Mar 03, 2025
Viaarxiv icon

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Add code
Feb 28, 2025
Figure 1 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 2 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 3 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Figure 4 for HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Viaarxiv icon

Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management

Add code
Feb 26, 2025
Figure 1 for Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management
Figure 2 for Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management
Figure 3 for Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management
Figure 4 for Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management
Viaarxiv icon