Picture for Xi Xiao

Xi Xiao

4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis

Add code
Apr 23, 2025
Viaarxiv icon

TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models

Add code
Apr 21, 2025
Viaarxiv icon

CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization

Add code
Mar 31, 2025
Viaarxiv icon

MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization

Add code
Mar 16, 2025
Viaarxiv icon

CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement

Add code
Mar 11, 2025
Viaarxiv icon

TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection

Add code
Jan 24, 2025
Figure 1 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 2 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 3 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Figure 4 for TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection
Viaarxiv icon

Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model

Add code
Jan 05, 2025
Figure 1 for Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Figure 2 for Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Figure 3 for Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Figure 4 for Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Viaarxiv icon

Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs

Add code
Jan 02, 2025
Figure 1 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 2 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 3 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 4 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Viaarxiv icon

Query-Efficient Video Adversarial Attack with Stylized Logo

Add code
Aug 22, 2024
Viaarxiv icon

HGTDP-DTA: Hybrid Graph-Transformer with Dynamic Prompt for Drug-Target Binding Affinity Prediction

Add code
Jun 25, 2024
Figure 1 for HGTDP-DTA: Hybrid Graph-Transformer with Dynamic Prompt for Drug-Target Binding Affinity Prediction
Figure 2 for HGTDP-DTA: Hybrid Graph-Transformer with Dynamic Prompt for Drug-Target Binding Affinity Prediction
Figure 3 for HGTDP-DTA: Hybrid Graph-Transformer with Dynamic Prompt for Drug-Target Binding Affinity Prediction
Figure 4 for HGTDP-DTA: Hybrid Graph-Transformer with Dynamic Prompt for Drug-Target Binding Affinity Prediction
Viaarxiv icon