Picture for Baihe Huang

Baihe Huang

Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

Add code
Jun 28, 2024
Viaarxiv icon

Data Acquisition via Experimental Design for Decentralized Data Markets

Add code
Mar 20, 2024
Figure 1 for Data Acquisition via Experimental Design for Decentralized Data Markets
Figure 2 for Data Acquisition via Experimental Design for Decentralized Data Markets
Figure 3 for Data Acquisition via Experimental Design for Decentralized Data Markets
Figure 4 for Data Acquisition via Experimental Design for Decentralized Data Markets
Viaarxiv icon

Towards Optimal Statistical Watermarking

Add code
Dec 13, 2023
Viaarxiv icon

On Representation Complexity of Model-based and Model-free Reinforcement Learning

Add code
Oct 03, 2023
Figure 1 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 2 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 3 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 4 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Viaarxiv icon

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

Add code
Jun 22, 2023
Viaarxiv icon

Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning

Add code
Jun 08, 2023
Figure 1 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 2 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 3 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 4 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Viaarxiv icon

Offline Reinforcement Learning with Realizability and Single-policy Concentrability

Add code
Feb 11, 2022
Figure 1 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Figure 2 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Viaarxiv icon

Towards General Function Approximation in Zero-Sum Markov Games

Add code
Jul 30, 2021
Viaarxiv icon

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Add code
Jul 14, 2021
Figure 1 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Figure 2 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Viaarxiv icon

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Add code
Jul 09, 2021
Figure 1 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Figure 2 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Viaarxiv icon