Picture for Yucheng Shi

Yucheng Shi

Reasoning or Memorization? Direction-Aware Diversity Exploration in LLM Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon

Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

Add code
Jun 03, 2026
Viaarxiv icon

TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL

Add code
Jun 01, 2026
Viaarxiv icon

Self-Improving Small Object Grounding in LVLMs

Add code
Jun 01, 2026
Viaarxiv icon

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

Add code
May 14, 2026
Viaarxiv icon

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Add code
May 10, 2026
Viaarxiv icon

Reinforcing Multimodal Reasoning Against Visual Degradation

Add code
May 10, 2026
Viaarxiv icon

Information Coordination as a Bridge: A Neuro-Symbolic Architecture for Reliable Autonomous Driving Scene Understanding

Add code
May 06, 2026
Viaarxiv icon

From Logs to Language: Learning Optimal Verbalization for LLM-Based Recommendation in Production

Add code
Feb 24, 2026
Viaarxiv icon

Automating Expert-Level Medical Reasoning Evaluation of Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon