Alert button
Picture for Mengdi Wang

Mengdi Wang

Alert button

Optimal policy evaluation using kernel-based temporal difference methods

Add code
Bookmark button
Alert button
Sep 24, 2021
Yaqi Duan, Mengdi Wang, Martin J. Wainwright

Figure 1 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 2 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 3 for Optimal policy evaluation using kernel-based temporal difference methods
Figure 4 for Optimal policy evaluation using kernel-based temporal difference methods
Viaarxiv icon

Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data

Add code
Bookmark button
Alert button
Jul 16, 2021
Jiandong Mu, Mengdi Wang, Feiwen Zhu, Jun Yang, Wei Lin, Wei Zhang

Figure 1 for Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data
Figure 2 for Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data
Figure 3 for Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data
Figure 4 for Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data
Viaarxiv icon

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

Add code
Bookmark button
Alert button
Jun 15, 2021
Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang, Mengdi Wang, Alec Koppel

Figure 1 for On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Figure 2 for On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Figure 3 for On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Figure 4 for On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Viaarxiv icon

1$\times$N Block Pattern for Network Sparsity

Add code
Bookmark button
Alert button
Jun 15, 2021
Mingbao Lin, Yuchao Li, Yuxin Zhang, Bohong Chen, Fei Chao, Mengdi Wang, Shen Li, Jun Yang, Rongrong Ji

Figure 1 for 1$\times$N Block Pattern for Network Sparsity
Figure 2 for 1$\times$N Block Pattern for Network Sparsity
Figure 3 for 1$\times$N Block Pattern for Network Sparsity
Figure 4 for 1$\times$N Block Pattern for Network Sparsity
Viaarxiv icon

You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient

Add code
Bookmark button
Alert button
Jun 04, 2021
Shaokun Zhang, Xiawu Zheng, Chenyi Yang, Yuchao Li, Yan Wang, Fei Chao, Mengdi Wang, Shen Li, Jun Yang, Rongrong Ji

Figure 1 for You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient
Figure 2 for You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient
Figure 3 for You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient
Figure 4 for You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Nature Gradient
Viaarxiv icon

MARL with General Utilities via Decentralized Shadow Reward Actor-Critic

Add code
Bookmark button
Alert button
May 29, 2021
Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel

Figure 1 for MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Figure 2 for MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Figure 3 for MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Figure 4 for MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Viaarxiv icon

Towards Compact CNNs via Collaborative Compression

Add code
Bookmark button
Alert button
May 24, 2021
Yuchao Li, Shaohui Lin, Jianzhuang Liu, Qixiang Ye, Mengdi Wang, Fei Chao, Fan Yang, Jincheng Ma, Qi Tian, Rongrong Ji

Figure 1 for Towards Compact CNNs via Collaborative Compression
Figure 2 for Towards Compact CNNs via Collaborative Compression
Figure 3 for Towards Compact CNNs via Collaborative Compression
Figure 4 for Towards Compact CNNs via Collaborative Compression
Viaarxiv icon

Learning Good State and Action Representations via Tensor Decomposition

Add code
Bookmark button
Alert button
May 03, 2021
Chengzhuo Ni, Anru Zhang, Yaqi Duan, Mengdi Wang

Figure 1 for Learning Good State and Action Representations via Tensor Decomposition
Figure 2 for Learning Good State and Action Representations via Tensor Decomposition
Figure 3 for Learning Good State and Action Representations via Tensor Decomposition
Figure 4 for Learning Good State and Action Representations via Tensor Decomposition
Viaarxiv icon

On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method

Add code
Bookmark button
Alert button
Feb 17, 2021
Junyu Zhang, Chengzhuo Ni, Zheng Yu, Csaba Szepesvari, Mengdi Wang

Figure 1 for On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Viaarxiv icon