Alert button
Picture for Baihe Huang

Baihe Huang

Alert button

Data Acquisition via Experimental Design for Decentralized Data Markets

Mar 20, 2024
Charles Lu, Baihe Huang, Sai Praneeth Karimireddy, Praneeth Vepakomma, Michael Jordan, Ramesh Raskar

Viaarxiv icon

Towards Optimal Statistical Watermarking

Dec 13, 2023
Baihe Huang, Banghua Zhu, Hanlin Zhu, Jason D. Lee, Jiantao Jiao, Michael I. Jordan

Viaarxiv icon

On Representation Complexity of Model-based and Model-free Reinforcement Learning

Oct 03, 2023
Hanlin Zhu, Baihe Huang, Stuart Russell

Figure 1 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 2 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 3 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Figure 4 for On Representation Complexity of Model-based and Model-free Reinforcement Learning
Viaarxiv icon

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

Jun 22, 2023
Qian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee

Viaarxiv icon

Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning

Jun 08, 2023
Baihe Huang, Sai Praneeth Karimireddy, Michael I. Jordan

Figure 1 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 2 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 3 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Figure 4 for Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning
Viaarxiv icon

Offline Reinforcement Learning with Realizability and Single-policy Concentrability

Feb 11, 2022
Wenhao Zhan, Baihe Huang, Audrey Huang, Nan Jiang, Jason D. Lee

Figure 1 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Figure 2 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Viaarxiv icon

Towards General Function Approximation in Zero-Sum Markov Games

Jul 30, 2021
Baihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Jul 14, 2021
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

Figure 1 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Figure 2 for Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Viaarxiv icon

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Jul 09, 2021
Baihe Huang, Kaixuan Huang, Sham M. Kakade, Jason D. Lee, Qi Lei, Runzhe Wang, Jiaqi Yang

Figure 1 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Figure 2 for Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Viaarxiv icon