Picture for Qi Meng

Qi Meng

Machine-Learning Non-Conservative Dynamics for New-Physics Detection

Add code
Jun 02, 2021
Figure 1 for Machine-Learning Non-Conservative Dynamics for New-Physics Detection
Figure 2 for Machine-Learning Non-Conservative Dynamics for New-Physics Detection
Figure 3 for Machine-Learning Non-Conservative Dynamics for New-Physics Detection
Figure 4 for Machine-Learning Non-Conservative Dynamics for New-Physics Detection
Viaarxiv icon

UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost

Add code
Apr 11, 2021
Figure 1 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 2 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 3 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Figure 4 for UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Viaarxiv icon

Towards Accelerating Training of Batch Normalization: A Manifold Perspective

Add code
Jan 08, 2021
Figure 1 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 2 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 3 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Figure 4 for Towards Accelerating Training of Batch Normalization: A Manifold Perspective
Viaarxiv icon

The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks

Add code
Dec 11, 2020
Figure 1 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 2 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 3 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Figure 4 for The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
Viaarxiv icon

Dynamic of Stochastic Gradient Descent with State-Dependent Noise

Add code
Jul 06, 2020
Figure 1 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 2 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 3 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Figure 4 for Dynamic of Stochastic Gradient Descent with State-Dependent Noise
Viaarxiv icon

Interpreting Basis Path Set in Neural Networks

Add code
Oct 18, 2019
Figure 1 for Interpreting Basis Path Set in Neural Networks
Figure 2 for Interpreting Basis Path Set in Neural Networks
Figure 3 for Interpreting Basis Path Set in Neural Networks
Figure 4 for Interpreting Basis Path Set in Neural Networks
Viaarxiv icon

Reinforcement Learning with Dynamic Boltzmann Softmax Updates

Add code
Mar 15, 2019
Figure 1 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 2 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 3 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Figure 4 for Reinforcement Learning with Dynamic Boltzmann Softmax Updates
Viaarxiv icon

Positively Scale-Invariant Flatness of ReLU Neural Networks

Add code
Mar 06, 2019
Figure 1 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Figure 2 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Figure 3 for Positively Scale-Invariant Flatness of ReLU Neural Networks
Viaarxiv icon

$\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space

Add code
Oct 09, 2018
Figure 1 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 2 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 3 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Figure 4 for $\mathcal{G}$-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Viaarxiv icon

Target Transfer Q-Learning and Its Convergence Analysis

Add code
Sep 21, 2018
Figure 1 for Target Transfer Q-Learning and Its Convergence Analysis
Viaarxiv icon