Picture for Jiang Bian

Jiang Bian

Beyond Pixel Histories: World Models with Persistent 3D State

Add code
Mar 03, 2026
Viaarxiv icon

Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

Add code
Mar 02, 2026
Viaarxiv icon

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Add code
Mar 02, 2026
Viaarxiv icon

Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective

Add code
Mar 01, 2026
Viaarxiv icon

Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning

Add code
Feb 10, 2026
Viaarxiv icon

SceneReVis: A Self-Reflective Vision-Grounded Framework for 3D Indoor Scene Synthesis via Multi-turn RL

Add code
Feb 10, 2026
Viaarxiv icon

Self-Hinting Language Models Enhance Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

RC-GRPO: Reward-Conditioned Group Relative Policy Optimization for Multi-Turn Tool Calling Agents

Add code
Feb 03, 2026
Viaarxiv icon

LIVE: Long-horizon Interactive Video World Modeling

Add code
Feb 03, 2026
Viaarxiv icon

ReLayout: Versatile and Structure-Preserving Design Layout Editing via Relation-Aware Design Reconstruction

Add code
Feb 01, 2026
Viaarxiv icon