Picture for Haoran Xu

Haoran Xu

Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

Add code
Jun 08, 2026
Viaarxiv icon

SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling

Add code
Jun 08, 2026
Viaarxiv icon

Trajectory-Refined Distillation

Add code
Jun 07, 2026
Viaarxiv icon

Hierarchical Certified Semantic Commitment for Byzantine-Resilient LLM-Agent Collaboration

Add code
Jun 05, 2026
Viaarxiv icon

Cherry-pick Override: Unsafe Directional Commitment in LLM Judges under Mixed Evidence

Add code
Jun 05, 2026
Viaarxiv icon

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Add code
Jun 01, 2026
Viaarxiv icon

SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models

Add code
May 28, 2026
Viaarxiv icon

MedMemoryBench: Benchmarking Agent Memory in Personalized Healthcare

Add code
May 12, 2026
Viaarxiv icon

Reinforcement Learning via Value Gradient Flow

Add code
Apr 15, 2026
Viaarxiv icon

Beyond Monologue: Interactive Talking-Listening Avatar Generation with Conversational Audio Context-Aware Kernels

Add code
Apr 11, 2026
Viaarxiv icon