Picture for Zhuoran Yang

Zhuoran Yang

Quantile-Optimal Policy Learning under Unmeasured Confounding

Add code
Jun 08, 2025
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Viaarxiv icon

Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving

Add code
Apr 17, 2025
Viaarxiv icon

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Add code
Mar 17, 2025
Viaarxiv icon

Nash Equilibrium Constrained Auto-bidding With Bi-level Reinforcement Learning

Add code
Mar 13, 2025
Viaarxiv icon

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Add code
Feb 23, 2025
Viaarxiv icon

DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization

Add code
Feb 11, 2025
Viaarxiv icon

Active Advantage-Aligned Online Reinforcement Learning with Offline Data

Add code
Feb 11, 2025
Viaarxiv icon

Learning Task Representations from In-Context Learning

Add code
Feb 08, 2025
Viaarxiv icon

An Instrumental Value for Data Production and its Application to Data Pricing

Add code
Dec 24, 2024
Viaarxiv icon