Picture for Xingyu Zhu

Xingyu Zhu

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning

Add code
Mar 23, 2026
Viaarxiv icon

MuSteerNet: Human Reaction Generation from Videos via Observation-Reaction Mutual Steering

Add code
Mar 20, 2026
Viaarxiv icon

Multimodal OCR: Parse Anything from Documents

Add code
Mar 13, 2026
Viaarxiv icon

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation

Add code
Feb 27, 2026
Viaarxiv icon

Thinking with Images as Continuous Actions: Numerical Visual Chain-of-Thought

Add code
Feb 27, 2026
Viaarxiv icon

Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control

Add code
Feb 07, 2026
Viaarxiv icon

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Add code
Feb 05, 2026
Viaarxiv icon

Contextual Drag: How Errors in the Context Affect LLM Reasoning

Add code
Feb 04, 2026
Viaarxiv icon