Picture for Henry Peng Zou

Henry Peng Zou

Toward User Preference Alignment in LLM Recommendation via Explicit Context Feedback

Add code
May 27, 2026
Viaarxiv icon

Resolving Action Bottleneck: Agentic Reinforcement Learning Informed by Token-Level Energy

Add code
May 14, 2026
Viaarxiv icon

GAM: Hierarchical Graph-based Agentic Memory for LLM Agents

Add code
Apr 14, 2026
Viaarxiv icon

Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation

Add code
Apr 04, 2026
Viaarxiv icon

EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

Add code
Apr 02, 2026
Viaarxiv icon

When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

Add code
Apr 01, 2026
Viaarxiv icon

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

Add code
Apr 01, 2026
Viaarxiv icon

EpochX: Building the Infrastructure for an Emergent Agent Civilization

Add code
Mar 28, 2026
Viaarxiv icon

When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution

Add code
Mar 19, 2026
Viaarxiv icon

Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training

Add code
Feb 24, 2026
Viaarxiv icon