Picture for Bo Zheng

Bo Zheng

additional authors not shown

SkillChain: Closing the Loop on Skill Evolution for Image-Based E-Commerce AI Assistants

Add code
Jun 11, 2026
Viaarxiv icon

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Add code
Jun 10, 2026
Viaarxiv icon

ReCal: Reward Calibration for RL-based LLM Routing

Add code
Jun 10, 2026
Viaarxiv icon

How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs

Add code
Jun 09, 2026
Viaarxiv icon

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Add code
Jun 01, 2026
Viaarxiv icon

Uniboost: Global Coordination with Value Alignment for Fair and Efficient Traffic Allocation

Add code
May 26, 2026
Viaarxiv icon

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Add code
May 21, 2026
Viaarxiv icon

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

Add code
May 20, 2026
Viaarxiv icon

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Add code
May 07, 2026
Viaarxiv icon

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Add code
Apr 28, 2026
Viaarxiv icon