Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Recurrent Neural Networks in the Eye of Differential Equations

Apr 29, 2019

Murphy Yuezhen Niu, Lior Horesh, Isaac Chuang

Figure 1 for Recurrent Neural Networks in the Eye of Differential Equations

Figure 2 for Recurrent Neural Networks in the Eye of Differential Equations

Figure 3 for Recurrent Neural Networks in the Eye of Differential Equations

Figure 4 for Recurrent Neural Networks in the Eye of Differential Equations

Share this with someone who'll enjoy it:

Abstract:To understand the fundamental trade-offs between training stability, temporal dynamics and architectural complexity of recurrent neural networks~(RNNs), we directly analyze RNN architectures using numerical methods of ordinary differential equations~(ODEs). We define a general family of RNNs--the ODERNNs--by relating the composition rules of RNNs to integration methods of ODEs at discrete time steps. We show that the degree of RNN's functional nonlinearity $n$ and the range of its temporal memory $t$ can be mapped to the corresponding stage of Runge-Kutta recursion and the order of time-derivative of the ODEs. We prove that popular RNN architectures, such as LSTM and URNN, fit into different orders of $n$-$t$-ODERNNs. This exact correspondence between RNN and ODE helps us to establish the sufficient conditions for RNN training stability and facilitates more flexible top-down designs of new RNN architectures using large varieties of toolboxes from numerical integration of ODEs. We provide such an example: Quantum-inspired Universal computing Neural Network~(QUNN), which reduces the required number of training parameters from polynomial in both data length and temporal memory length to only linear in temporal memory length.

* 25pages, 3 figures

View paper on

Share this with someone who'll enjoy it:

Title:Recurrent Neural Networks in the Eye of Differential Equations

Paper and Code