Picture for Hang Yan

Hang Yan

Dual-Dimensional Consistency: Balancing Budget and Quality in Adaptive Inference-Time Scaling

Add code
May 14, 2026
Viaarxiv icon

Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses

Add code
Apr 28, 2026
Viaarxiv icon

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Add code
Apr 16, 2026
Viaarxiv icon

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning

Add code
Apr 15, 2026
Viaarxiv icon

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Add code
Feb 05, 2026
Viaarxiv icon

Steering LLMs via Scalable Interactive Oversight

Add code
Feb 04, 2026
Viaarxiv icon

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Add code
Feb 03, 2026
Viaarxiv icon

Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning

Add code
Sep 16, 2025
Figure 1 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 2 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 3 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Figure 4 for Forget What's Sensitive, Remember What Matters: Token-Level Differential Privacy in Memory Sculpting for Continual Learning
Viaarxiv icon

Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories

Add code
Sep 16, 2025
Figure 1 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 2 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 3 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Figure 4 for Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories
Viaarxiv icon

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark

Add code
Aug 26, 2025
Viaarxiv icon