Picture for Bing Liu

Bing Liu

Jack

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

Add code
May 19, 2026
Viaarxiv icon

Reward Hacking in Rubric-Based Reinforcement Learning

Add code
May 12, 2026
Viaarxiv icon

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?

Add code
Apr 13, 2026
Viaarxiv icon

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

Add code
Apr 12, 2026
Viaarxiv icon

MCP-Atlas: A Large-Scale Benchmark for Tool-Use Competency with Real MCP Servers

Add code
Jan 31, 2026
Viaarxiv icon

Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Add code
Jan 28, 2026
Viaarxiv icon

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Add code
Jan 27, 2026
Viaarxiv icon

Cross-Session Decoding of Neural Spiking Data via Task-Conditioned Latent Alignment

Add code
Jan 27, 2026
Viaarxiv icon

Continual Learning of Achieving Forgetting-free and Positive Knowledge Transfer

Add code
Jan 09, 2026
Viaarxiv icon

Agentic Rubrics as Contextual Verifiers for SWE Agents

Add code
Jan 07, 2026
Viaarxiv icon