Alert button
Picture for Mengdi Wang

Mengdi Wang

Alert button

Communication Efficient Distributed Learning for Kernelized Contextual Bandits

Add code
Bookmark button
Alert button
Jun 10, 2022
Chuanhao Li, Huazheng Wang, Mengdi Wang, Hongning Wang

Figure 1 for Communication Efficient Distributed Learning for Kernelized Contextual Bandits
Figure 2 for Communication Efficient Distributed Learning for Kernelized Contextual Bandits
Figure 3 for Communication Efficient Distributed Learning for Kernelized Contextual Bandits
Viaarxiv icon

Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks

Add code
Bookmark button
Alert button
Jun 06, 2022
Xiang Ji, Minshuo Chen, Mengdi Wang, Tuo Zhao

Figure 1 for Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks
Figure 2 for Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks
Figure 3 for Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks
Figure 4 for Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks
Viaarxiv icon

Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization

Add code
Bookmark button
Alert button
Jun 05, 2022
Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong, Csaba Szepesvári, Mengdi Wang

Figure 1 for Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Figure 2 for Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Figure 3 for Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Figure 4 for Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Viaarxiv icon

Byzantine-Robust Online and Offline Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2022
Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu

Viaarxiv icon

Provable Benefits of Representational Transfer in Reinforcement Learning

Add code
Bookmark button
Alert button
May 29, 2022
Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang

Figure 1 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 2 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 3 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 4 for Provable Benefits of Representational Transfer in Reinforcement Learning
Viaarxiv icon

Parameter-Efficient Sparsity for Large Language Models Fine-Tuning

Add code
Bookmark button
Alert button
May 23, 2022
Yuchao Li, Fuli Luo, Chuanqi Tan, Mengdi Wang, Songfang Huang, Shen Li, Junjie Bai

Figure 1 for Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Figure 2 for Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Figure 3 for Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Figure 4 for Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Viaarxiv icon

Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism

Add code
Bookmark button
Alert button
Mar 11, 2022
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang

Figure 1 for Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Viaarxiv icon

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory

Add code
Bookmark button
Alert button
Feb 10, 2022
Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang

Figure 1 for Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Viaarxiv icon

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach

Add code
Bookmark button
Alert button
Feb 02, 2022
Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun

Figure 1 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 2 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 3 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 4 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Viaarxiv icon

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Add code
Bookmark button
Alert button
Jan 31, 2022
Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang

Figure 1 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 2 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 3 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 4 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Viaarxiv icon