Picture for Jian Liang

Jian Liang

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Add code
Mar 20, 2026
Viaarxiv icon

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Add code
Feb 27, 2026
Viaarxiv icon

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Mitigating the Safety-utility Trade-off in LLM Alignment via Adaptive Safe Context Learning

Add code
Feb 14, 2026
Viaarxiv icon

Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs

Add code
Feb 12, 2026
Viaarxiv icon

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Add code
Feb 12, 2026
Viaarxiv icon

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Learning Fair Domain Adaptation with Virtual Label Distribution

Add code
Jan 26, 2026
Viaarxiv icon

TAT: Task-Adaptive Transformer for All-in-One Medical Image Restoration

Add code
Dec 16, 2025
Viaarxiv icon

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Add code
Dec 14, 2025
Viaarxiv icon