Picture for Yi Zhang

Yi Zhang

Carnegie Mellon University

ChinaHeritaQA: A Culturally-Grounded Visual Question Answering Dataset for World Heritage Sites in China

Add code
Jun 08, 2026
Viaarxiv icon

DriveReward: A Comprehensive Dataset and Generative Vision-Language Reward Model for Autonomous Driving

Add code
Jun 07, 2026
Viaarxiv icon

Adaptive Latent Agentic Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing

Add code
May 28, 2026
Viaarxiv icon

Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs

Add code
May 28, 2026
Viaarxiv icon

AnomalyAgent: Training-Free Agentic Models for Zero-/Few-Shot Anomaly Detection

Add code
May 28, 2026
Viaarxiv icon

Rec-Distill: An Industrial Distillation Pipeline for Large-Scale Recommendation Models

Add code
May 28, 2026
Viaarxiv icon

GUITestScape: Towards Open-set Evaluation on Exploratory GUI Testing

Add code
May 28, 2026
Viaarxiv icon

PhoneWorld: Scaling Phone-Use Agent Environments

Add code
May 28, 2026
Viaarxiv icon

IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation

Add code
May 28, 2026
Viaarxiv icon