Picture for Yao Huang

Yao Huang

Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space

Add code
May 28, 2025
Viaarxiv icon

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Add code
May 28, 2025
Viaarxiv icon

Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives

Add code
May 23, 2025
Viaarxiv icon

Decoupled Geometric Parameterization and its Application in Deep Homography Estimation

Add code
May 22, 2025
Viaarxiv icon

Towards NSFW-Free Text-to-Image Generation via Safety-Constraint Direct Preference Optimization

Add code
Apr 19, 2025
Viaarxiv icon

RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability

Add code
Apr 14, 2025
Viaarxiv icon

When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack

Add code
Mar 10, 2025
Viaarxiv icon

STAIR: Improving Safety Alignment with Introspective Reasoning

Add code
Feb 04, 2025
Figure 1 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 2 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 3 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 4 for STAIR: Improving Safety Alignment with Introspective Reasoning
Viaarxiv icon

PaMMA-Net: Plasmas magnetic measurement evolution based on data-driven incremental accumulative prediction

Add code
Jan 23, 2025
Viaarxiv icon

AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?

Add code
Dec 04, 2024
Figure 1 for AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Figure 2 for AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Figure 3 for AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Figure 4 for AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?
Viaarxiv icon