Picture for Lin F. Yang

Lin F. Yang

Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model

Add code
Jul 02, 2025
Viaarxiv icon

Does Feedback Help in Bandits with Arm Erasures?

Add code
Apr 29, 2025
Viaarxiv icon

NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models

Add code
Apr 20, 2025
Viaarxiv icon

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

Add code
Dec 04, 2024
Figure 1 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 2 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 3 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 4 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Viaarxiv icon

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error

Add code
Jul 18, 2024
Figure 1 for Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Figure 2 for Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Viaarxiv icon

Learning for Bandits under Action Erasures

Add code
Jun 26, 2024
Viaarxiv icon

Confident Natural Policy Gradient for Local Planning in $q_π$-realizable Constrained MDPs

Add code
Jun 26, 2024
Viaarxiv icon

Don't Forget to Connect! Improving RAG with Graph-based Reranking

Add code
May 28, 2024
Viaarxiv icon

Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels

Add code
Dec 21, 2023
Figure 1 for Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels
Figure 2 for Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels
Figure 3 for Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels
Figure 4 for Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels
Viaarxiv icon

Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation

Add code
Dec 07, 2023
Figure 1 for Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Figure 2 for Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Figure 3 for Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Figure 4 for Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Viaarxiv icon