Picture for Amy Zhang

Amy Zhang

Reinforcement Learning via Value Gradient Flow

Add code
Apr 15, 2026
Viaarxiv icon

Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces

Add code
Apr 13, 2026
Viaarxiv icon

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Add code
Mar 17, 2026
Viaarxiv icon

Regularized Latent Dynamics Prediction is a Strong Baseline For Behavioral Foundation Models

Add code
Mar 16, 2026
Viaarxiv icon

A Recipe for Stable Offline Multi-agent Reinforcement Learning

Add code
Mar 09, 2026
Viaarxiv icon

Factored Latent Action World Models

Add code
Feb 18, 2026
Viaarxiv icon

Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

Add code
Feb 12, 2026
Viaarxiv icon

Hierarchical Entity-centric Reinforcement Learning with Factored Subgoal Diffusion

Add code
Feb 02, 2026
Viaarxiv icon

Learning Robust Reasoning through Guided Adversarial Self-Play

Add code
Jan 30, 2026
Viaarxiv icon

Multi-agent Coordination via Flow Matching

Add code
Nov 07, 2025
Viaarxiv icon