Picture for Qianchun Lu

Qianchun Lu

MoL-RL: Distilling Multi-Step Environmental Feedback into LLMs for Feedback-Independent Reasoning

Add code
Jul 27, 2025
Viaarxiv icon

MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities

Add code
May 17, 2025
Viaarxiv icon

Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation

Add code
May 13, 2025
Viaarxiv icon