Alert button
Picture for Qi Meng

Qi Meng

Alert button

UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost

Add code
Bookmark button
Alert button
Apr 11, 2021
Zhen Wu, Lijun Wu, Qi Meng, Yingce Xia, Shufang Xie, Tao Qin, Xinyu Dai, Tie-Yan Liu

Figure 1 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 2 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 3 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 4 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Viaarxiv icon

Towards Accelerating Training of Batch Normalization: A Manifold Perspective

Add code
Bookmark button
Alert button
Jan 08, 2021
Mingyang Yi, Qi Meng, Wei Chen, Zhi-Ming Ma

Figure 1 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 2 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 3 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 4 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Viaarxiv icon

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Add code
Bookmark button
Alert button
Dec 11, 2020
Bohan Wang, Qi Meng, Wei Chen

Figure 1 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 2 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 3 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 4 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Viaarxiv icon

Dynamic of Stochastic Gradient Descent with State-Dependent Noise

Add code
Bookmark button
Alert button
Jul 06, 2020
Qi Meng, Shiqi Gong, Wei Chen, Zhi-Ming Ma, Tie-Yan Liu

Figure 1 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 2 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 3 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 4 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Viaarxiv icon

Interpreting Basis Path Set in Neural Networks

Add code
Bookmark button
Alert button
Oct 18, 2019
Juanping Zhu, Qi Meng, Wei Chen, Zhi-ming Ma

Figure 1 for Interpreting Basis Path Set in Neural Networks
Figure 2 for Interpreting Basis Path Set in Neural Networks
Figure 3 for Interpreting Basis Path Set in Neural Networks
Figure 4 for Interpreting Basis Path Set in Neural Networks
Viaarxiv icon

Reinforcement Learning with Dynamic Boltzmann Softmax Updates

Add code
Bookmark button
Alert button
Mar 15, 2019
Ling Pan, Qingpeng Cai, Qi Meng, Wei Chen, Longbo Huang, Tie-Yan Liu

Figure 1 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 2 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 3 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 4 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Viaarxiv icon

Positively Scale-Invariant Flatness of ReLU Neural Networks

Add code
Bookmark button
Alert button
Mar 06, 2019
Mingyang Yi, Qi Meng, Wei Chen, Zhi-ming Ma, Tie-Yan Liu

Figure 1 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Figure 2 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Figure 3 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Viaarxiv icon

$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space

Add code
Bookmark button
Alert button
Oct 09, 2018
Qi Meng, Wei Chen, Shuxin Zheng, Huishuai Zhang, Qiwei Ye, Zhi-Ming Ma, Tie-Yan Liu

Figure 1 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 2 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 3 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 4 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Viaarxiv icon

Target Transfer Q-Learning and Its Convergence Analysis

Add code
Bookmark button
Alert button
Sep 21, 2018
Yue Wang, Qi Meng, Wei Cheng, Yuting Liug, Zhi-Ming Ma, Tie-Yan Liu

Figure 1 for Target Transfer Q-Learning and Its Convergence Analysis
Viaarxiv icon