Picture for Linyang Li

Linyang Li

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

Add code
May 19, 2026
Viaarxiv icon

Beyond Mode Collapse: Distribution Matching for Diverse Reasoning

Add code
May 19, 2026
Viaarxiv icon

COSMO-Agent: Tool-Augmented Agent for Closed-loop Optimization,Simulation,and Modeling Orchestration

Add code
Apr 07, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Add code
Jan 23, 2026
Viaarxiv icon

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Add code
Jan 23, 2026
Viaarxiv icon

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Add code
Jan 23, 2026
Viaarxiv icon

MedCalc-Eval and MedCalc-Env: Advancing Medical Calculation Capabilities of Large Language Models

Add code
Oct 31, 2025
Viaarxiv icon

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Add code
Aug 12, 2025
Figure 1 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 2 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 3 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 4 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Viaarxiv icon

UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance

Add code
Feb 17, 2025
Viaarxiv icon