Picture for Brendan Maginnis

Brendan Maginnis

A short variational proof of equivalence between policy gradients and soft Q learning

Add code
Dec 22, 2017
Viaarxiv icon

On Wasserstein Reinforcement Learning and the Fokker-Planck equation

Add code
Dec 19, 2017
Viaarxiv icon

Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit

Add code
Jun 19, 2017
Figure 1 for Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
Figure 2 for Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
Figure 3 for Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
Figure 4 for Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit
Viaarxiv icon