Picture for Sizhe Tang

Sizhe Tang

Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization

Add code
Apr 08, 2026
Viaarxiv icon

IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents

Add code
Apr 06, 2026
Viaarxiv icon

Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics

Add code
Feb 06, 2026
Viaarxiv icon

Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents

Add code
Feb 03, 2026
Viaarxiv icon

ACDZero: Graph-Embedding-Based Tree Search for Mastering Automated Cyber Defense

Add code
Jan 05, 2026
Viaarxiv icon

MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning

Add code
Nov 08, 2025
Viaarxiv icon