Picture for Jian Liang

Jian Liang

Kwai Summary Attention Technical Report

Add code
Apr 27, 2026
Viaarxiv icon

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning

Add code
Apr 23, 2026
Viaarxiv icon

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Add code
Mar 20, 2026
Viaarxiv icon

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Add code
Feb 27, 2026
Viaarxiv icon

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Mitigating the Safety-utility Trade-off in LLM Alignment via Adaptive Safe Context Learning

Add code
Feb 14, 2026
Viaarxiv icon

Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs

Add code
Feb 12, 2026
Viaarxiv icon

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Add code
Feb 12, 2026
Viaarxiv icon

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Learning Fair Domain Adaptation with Virtual Label Distribution

Add code
Jan 26, 2026
Viaarxiv icon