Picture for Quanquan Gu

Quanquan Gu

Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models

Add code
May 31, 2023
Figure 1 for Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models
Figure 2 for Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models
Figure 3 for Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models
Figure 4 for Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models
Viaarxiv icon

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

Add code
May 15, 2023
Viaarxiv icon

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Add code
May 15, 2023
Viaarxiv icon

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

Add code
May 12, 2023
Viaarxiv icon

Personalized Federated Learning under Mixture of Distributions

Add code
May 01, 2023
Figure 1 for Personalized Federated Learning under Mixture of Distributions
Figure 2 for Personalized Federated Learning under Mixture of Distributions
Figure 3 for Personalized Federated Learning under Mixture of Distributions
Figure 4 for Personalized Federated Learning under Mixture of Distributions
Viaarxiv icon

Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs

Add code
Mar 17, 2023
Viaarxiv icon

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

Add code
Mar 16, 2023
Figure 1 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 2 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 3 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 4 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Viaarxiv icon

Borda Regret Minimization for Generalized Linear Dueling Bandits

Add code
Mar 15, 2023
Viaarxiv icon

The Benefits of Mixup for Feature Learning

Add code
Mar 15, 2023
Viaarxiv icon

Benign Overfitting for Two-layer ReLU Networks

Add code
Mar 07, 2023
Viaarxiv icon