Picture for Xingyu Zhu

Xingyu Zhu

Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning

Add code
Mar 23, 2026
Viaarxiv icon

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

MuSteerNet: Human Reaction Generation from Videos via Observation-Reaction Mutual Steering

Add code
Mar 20, 2026
Viaarxiv icon

Multimodal OCR: Parse Anything from Documents

Add code
Mar 13, 2026
Viaarxiv icon

Thinking with Images as Continuous Actions: Numerical Visual Chain-of-Thought

Add code
Feb 27, 2026
Viaarxiv icon

Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation

Add code
Feb 27, 2026
Viaarxiv icon

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control

Add code
Feb 07, 2026
Viaarxiv icon

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

Add code
Feb 05, 2026
Viaarxiv icon

Contextual Drag: How Errors in the Context Affect LLM Reasoning

Add code
Feb 04, 2026
Viaarxiv icon