Picture for Lidong Bing

Lidong Bing

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Add code
Mar 04, 2026
Viaarxiv icon

LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards

Add code
Mar 02, 2026
Viaarxiv icon

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Document Reconstruction Unlocks Scalable Long-Context RLVR

Add code
Feb 09, 2026
Viaarxiv icon

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Add code
Jan 14, 2026
Viaarxiv icon

EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning

Add code
Jan 05, 2026
Viaarxiv icon

On the Role of Discreteness in Diffusion LLMs

Add code
Dec 27, 2025
Viaarxiv icon

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

Multi-Agent Tool-Integrated Policy Optimization

Add code
Oct 06, 2025
Figure 1 for Multi-Agent Tool-Integrated Policy Optimization
Figure 2 for Multi-Agent Tool-Integrated Policy Optimization
Figure 3 for Multi-Agent Tool-Integrated Policy Optimization
Figure 4 for Multi-Agent Tool-Integrated Policy Optimization
Viaarxiv icon