Picture for Heyan Huang

Heyan Huang

Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards

Add code
Jun 17, 2026
Viaarxiv icon

PathRouter: Aligning Rewards with Retrieval Quality in Agentic Graph Retrieval-Augmented Generation

Add code
Jun 15, 2026
Viaarxiv icon

AdaPLD: Adaptive Retrieval and Reuse for Efficient Model-Free Speculative Decoding

Add code
Jun 04, 2026
Viaarxiv icon

Benchmarking Living-Screen-Native GUI Agents on Short-Video Platforms

Add code
Jun 03, 2026
Viaarxiv icon

MTAVG-Bench 2.0: Diagnosing Failure Modes of Cinematic Expressiveness in Multi-Talker Audio-Video Generation

Add code
May 27, 2026
Viaarxiv icon

Zero-Shot Detection of LLM-Generated Text via Implicit Reward Model

Add code
Apr 23, 2026
Viaarxiv icon

MASS-RAG: Multi-Agent Synthesis Retrieval-Augmented Generation

Add code
Apr 21, 2026
Viaarxiv icon

EmergentBridge: Improving Zero-Shot Cross-Modal Transfer in Unified Multimodal Embedding Models

Add code
Apr 14, 2026
Viaarxiv icon

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization

Add code
Apr 13, 2026
Viaarxiv icon

Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation

Add code
Apr 13, 2026
Viaarxiv icon