Picture for Shengjie Wang

Shengjie Wang

Hindsight Hint Distillation: Scaffolded Reasoning for SWE Agents from CoT-free Answers

Add code
May 12, 2026
Viaarxiv icon

Parallel Prefix Verification for Speculative Generation

Add code
May 05, 2026
Viaarxiv icon

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Add code
Apr 06, 2026
Viaarxiv icon

InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context

Add code
Mar 05, 2026
Viaarxiv icon

Rooted Absorbed Prefix Trajectory Balance with Submodular Replay for GFlowNet Training

Add code
Feb 28, 2026
Viaarxiv icon

Affordance-Aware Interactive Decision-Making and Execution for Ambiguous Instructions

Add code
Feb 05, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Nightmare Dreamer: Dreaming About Unsafe States And Planning Ahead

Add code
Jan 08, 2026
Viaarxiv icon

Translating Flow to Policy via Hindsight Online Imitation

Add code
Dec 22, 2025
Viaarxiv icon

Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation

Add code
Oct 23, 2025
Viaarxiv icon