Picture for Zhichao Yang

Zhichao Yang

Speculative Rollback Correction for Quality-Diverse Web Agent Imitation

Add code
Jun 10, 2026
Viaarxiv icon

PACT: Learning Diverse Diagnostic Strategies via Privileged Synthesis and Branch Consensus

Add code
Jun 08, 2026
Viaarxiv icon

StainFlow: Entity-Stain Tracking and Evidence Linking for Process Rewards in GUI Agents

Add code
Jun 05, 2026
Viaarxiv icon

MIRAGE: Mobile Agents with Implicit Reasoning and Generative World Models

Add code
Jun 03, 2026
Viaarxiv icon

MedFabric and EtHER: A Data-Centric Framework for Word-Level Fabrication Generation and Detection in Medical LLMs

Add code
May 05, 2026
Viaarxiv icon

Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks

Add code
Mar 04, 2026
Viaarxiv icon

TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents

Add code
Mar 01, 2026
Viaarxiv icon

Fast and Effective On-policy Distillation from Reasoning Prefixes

Add code
Feb 16, 2026
Viaarxiv icon

ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference

Add code
Feb 10, 2026
Viaarxiv icon

Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs

Add code
Jan 26, 2026
Viaarxiv icon