Picture for Yi Zhang

Yi Zhang

Carnegie Mellon University

Current World Models Lack a Persistent State Core

Add code
Jun 18, 2026
Viaarxiv icon

Hallucination Detection and Correction in Medical VLMs via Counter-Evidence Verification

Add code
Jun 17, 2026
Viaarxiv icon

A First-Principles Derivation of LLM Policy Optimization: From Expected Reward to GRPO and Its Structural Extensions

Add code
Jun 15, 2026
Viaarxiv icon

PhoneHarness: Harnessing Phone-Use Agents through Mixed GUI, CLI, and Tool Actions

Add code
Jun 12, 2026
Viaarxiv icon

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Add code
Jun 10, 2026
Viaarxiv icon

ChinaHeritaQA: A Culturally-Grounded Visual Question Answering Dataset for World Heritage Sites in China

Add code
Jun 08, 2026
Viaarxiv icon

DriveReward: A Comprehensive Dataset and Generative Vision-Language Reward Model for Autonomous Driving

Add code
Jun 07, 2026
Viaarxiv icon

Adaptive Latent Agentic Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs

Add code
May 28, 2026
Viaarxiv icon

PhoneWorld: Scaling Phone-Use Agent Environments

Add code
May 28, 2026
Viaarxiv icon