Picture for Yali Du

Yali Du

Calibrate-Then-Delegate: Safety Monitoring with Risk and Budget Guarantees via Model Cascades

Add code
Apr 15, 2026
Viaarxiv icon

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Add code
Apr 05, 2026
Viaarxiv icon

Design-Specification Tiling for ICL-based CAD Code Generation

Add code
Mar 13, 2026
Viaarxiv icon

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study

Add code
Feb 18, 2026
Viaarxiv icon

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Is Pure Exploitation Sufficient in Exogenous MDPs with Linear Function Approximation?

Add code
Jan 28, 2026
Viaarxiv icon

Social World Model-Augmented Mechanism Design Policy Learning

Add code
Oct 22, 2025
Viaarxiv icon

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory

Add code
Aug 12, 2025
Viaarxiv icon

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks

Add code
May 20, 2025
Viaarxiv icon