Picture for Xiangnan He

Xiangnan He

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation

Add code
Mar 11, 2026
Viaarxiv icon

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

Fine-grained Semantics Integration for Large Language Model-based Recommendation

Add code
Feb 26, 2026
Viaarxiv icon

Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization

Add code
Feb 26, 2026
Viaarxiv icon

Uncertainty-aware Generative Recommendation

Add code
Feb 12, 2026
Viaarxiv icon

Towards Sample-Efficient and Stable Reinforcement Learning for LLM-based Recommendation

Add code
Jan 31, 2026
Viaarxiv icon

UniGRec: Unified Generative Recommendation with Soft Identifiers for End-to-End Optimization

Add code
Jan 24, 2026
Viaarxiv icon

RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models

Add code
Dec 08, 2025
Viaarxiv icon