Picture for Xijun Wang

Xijun Wang

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Viaarxiv icon

Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic

Add code
Jun 03, 2024
Viaarxiv icon

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

Add code
Apr 04, 2024
Figure 1 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 2 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 3 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 4 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Viaarxiv icon

A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models

Add code
Mar 15, 2024
Figure 1 for A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
Figure 2 for A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
Figure 3 for A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
Figure 4 for A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
Viaarxiv icon

Real-World Atmospheric Turbulence Correction via Domain Adaptation

Add code
Feb 12, 2024
Viaarxiv icon

Adv-Diffusion: Imperceptible Adversarial Face Identity Attack via Latent Diffusion Model

Add code
Dec 28, 2023
Viaarxiv icon

VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering

Add code
Dec 13, 2023
Figure 1 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 2 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 3 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Figure 4 for VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering
Viaarxiv icon

Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors

Add code
Oct 30, 2023
Figure 1 for Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors
Figure 2 for Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors
Figure 3 for Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors
Figure 4 for Optimal Status Updates for Minimizing Age of Correlated Information in IoT Networks with Energy Harvesting Sensors
Viaarxiv icon

Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration

Add code
Oct 26, 2023
Figure 1 for Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration
Figure 2 for Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration
Figure 3 for Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration
Figure 4 for Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration
Viaarxiv icon

ICAR: Image-based Complementary Auto Reasoning

Add code
Aug 17, 2023
Figure 1 for ICAR: Image-based Complementary Auto Reasoning
Figure 2 for ICAR: Image-based Complementary Auto Reasoning
Figure 3 for ICAR: Image-based Complementary Auto Reasoning
Figure 4 for ICAR: Image-based Complementary Auto Reasoning
Viaarxiv icon