Picture for Bo An

Bo An

Understanding Diversity Collapse in RLVR via the Lens of Overtraining

Add code
Jun 13, 2026
Viaarxiv icon

Science Earth: Towards A Planet-Scale Operating System for AI-Native Scientific Discovery

Add code
May 31, 2026
Viaarxiv icon

Adversarial Dual On-Policy Distillation from Expressive Flow-based Teacher

Add code
May 26, 2026
Viaarxiv icon

Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning

Add code
May 26, 2026
Viaarxiv icon

AutoDFT: A Closed-Loop Multi-Agent Framework for Autonomous DFT Calculations

Add code
May 25, 2026
Viaarxiv icon

How Mobile World Model Guides GUI Agents?

Add code
May 11, 2026
Viaarxiv icon

Self-Debias: Self-correcting for Debiasing Large Language Models

Add code
Apr 09, 2026
Viaarxiv icon

LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting

Add code
Mar 05, 2026
Viaarxiv icon

SPARC: Spatial-Aware Path Planning via Attentive Robot Communication

Add code
Mar 03, 2026
Viaarxiv icon

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Add code
Feb 26, 2026
Viaarxiv icon