Picture for Lin F. Yang

Lin F. Yang

Best-Arm Identification with Noisy Actuation

Add code
Apr 02, 2026
Viaarxiv icon

Near-Optimal Sample Complexity for Online Constrained MDPs

Add code
Feb 16, 2026
Viaarxiv icon

LACONIC: Length-Aware Constrained Reinforcement Learning for LLM

Add code
Feb 16, 2026
Viaarxiv icon

Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model

Add code
Jul 02, 2025
Viaarxiv icon

Does Feedback Help in Bandits with Arm Erasures?

Add code
Apr 29, 2025
Viaarxiv icon

NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models

Add code
Apr 20, 2025
Viaarxiv icon

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

Add code
Dec 04, 2024
Figure 1 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 2 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 3 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 4 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Viaarxiv icon

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error

Add code
Jul 18, 2024
Figure 1 for Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Figure 2 for Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Viaarxiv icon

Learning for Bandits under Action Erasures

Add code
Jun 26, 2024
Viaarxiv icon

Confident Natural Policy Gradient for Local Planning in $q_π$-realizable Constrained MDPs

Add code
Jun 26, 2024
Viaarxiv icon