Picture for Lifeng Shang

Lifeng Shang

Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification

Add code
Jun 05, 2025
Viaarxiv icon

Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning

Add code
May 30, 2025
Viaarxiv icon

Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning

Add code
May 23, 2025
Viaarxiv icon

The Real Barrier to LLM Agent Usability is Agentic ROI

Add code
May 23, 2025
Viaarxiv icon

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Add code
May 21, 2025
Viaarxiv icon

Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction

Add code
Apr 22, 2025
Viaarxiv icon

ToolACE-R: Tool Learning with Adaptive Self-Refinement

Add code
Apr 02, 2025
Viaarxiv icon

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework

Add code
Feb 26, 2025
Viaarxiv icon