Picture for Xiaolu Zhang

Xiaolu Zhang

The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF

Add code
May 28, 2026
Viaarxiv icon

Source-Grounded Semantic Reinforcement Learning for Low-Resource Target-Language Generation

Add code
May 28, 2026
Viaarxiv icon

Focal Reward: Balanced Reinforcement Learning under Rubric-Based Rewards

Add code
May 26, 2026
Viaarxiv icon

Reinforcement Learning with Semantic Rewards Enables Low-Resource Language Expansion without Alignment Tax

Add code
May 14, 2026
Viaarxiv icon

Deterministic Differentiable Structured Pruning for Large Language Models

Add code
Mar 09, 2026
Viaarxiv icon

LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Add code
Mar 01, 2026
Viaarxiv icon

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Add code
Feb 09, 2026
Viaarxiv icon

Per-parameter Task Arithmetic for Unlearning in Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Add code
Dec 24, 2025
Viaarxiv icon

EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions

Add code
May 29, 2025
Figure 1 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 2 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 3 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Figure 4 for EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions
Viaarxiv icon