Picture for Jingang Wang

Jingang Wang

Dynamic Rollout Editing for Reducing Overthinking in RL-Trained Reasoning Models

Add code
Jun 16, 2026
Viaarxiv icon

HIPIF: Hierarchical Planning and Information Folding for Long-Horizon LLM Agent Learning

Add code
Jun 09, 2026
Viaarxiv icon

When Model Merging Breaks Routing: Training-Free Calibration for MoE

Add code
Jun 02, 2026
Viaarxiv icon

ATLAS: All-round Testing of Long-context Abilities across Scales

Add code
May 27, 2026
Viaarxiv icon

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks

Add code
May 20, 2026
Viaarxiv icon

Teacher-Guided Policy Optimization for LLM Distillation

Add code
May 13, 2026
Viaarxiv icon

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Add code
May 13, 2026
Viaarxiv icon

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Add code
Mar 22, 2026
Viaarxiv icon

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Add code
Feb 09, 2026
Viaarxiv icon