Picture for Zhaoyuan Yang

Zhaoyuan Yang

Break the Brake, Not the Wheel: Untargeted Jailbreak via Entropy Maximization

Add code
May 11, 2026
Viaarxiv icon

All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

Add code
Dec 26, 2025
Viaarxiv icon

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Add code
Oct 02, 2025
Figure 1 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 2 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 3 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 4 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Viaarxiv icon

Probability Density Geodesics in Image Diffusion Latent Space

Add code
Apr 09, 2025
Figure 1 for Probability Density Geodesics in Image Diffusion Latent Space
Figure 2 for Probability Density Geodesics in Image Diffusion Latent Space
Figure 3 for Probability Density Geodesics in Image Diffusion Latent Space
Figure 4 for Probability Density Geodesics in Image Diffusion Latent Space
Viaarxiv icon

Identifying and Mitigating Position Bias of Multi-image Vision-Language Models

Add code
Mar 18, 2025
Figure 1 for Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
Figure 2 for Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
Figure 3 for Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
Figure 4 for Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
Viaarxiv icon

Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition

Add code
Feb 19, 2025
Figure 1 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 2 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 3 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 4 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Viaarxiv icon

SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Add code
Jan 20, 2025
Figure 1 for SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models
Figure 2 for SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models
Figure 3 for SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models
Figure 4 for SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models
Viaarxiv icon

DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models

Add code
Oct 15, 2024
Figure 1 for DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Figure 2 for DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Figure 3 for DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Figure 4 for DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Viaarxiv icon

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models

Add code
Nov 27, 2023
Figure 1 for ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
Figure 2 for ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
Figure 3 for ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
Figure 4 for ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
Viaarxiv icon