Alert button
Picture for Paavo Parmas

Paavo Parmas

Alert button

A unified view of likelihood ratio and reparameterization gradients

Add code
Bookmark button
Alert button
May 31, 2021
Paavo Parmas, Masashi Sugiyama

Figure 1 for A unified view of likelihood ratio and reparameterization gradients
Figure 2 for A unified view of likelihood ratio and reparameterization gradients
Figure 3 for A unified view of likelihood ratio and reparameterization gradients
Figure 4 for A unified view of likelihood ratio and reparameterization gradients
Viaarxiv icon

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme

Add code
Bookmark button
Alert button
Oct 14, 2019
Paavo Parmas, Masashi Sugiyama

Figure 1 for A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Figure 2 for A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Figure 3 for A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Figure 4 for A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Viaarxiv icon

Total stochastic gradient algorithms and applications in reinforcement learning

Add code
Bookmark button
Alert button
Feb 05, 2019
Paavo Parmas

Figure 1 for Total stochastic gradient algorithms and applications in reinforcement learning
Figure 2 for Total stochastic gradient algorithms and applications in reinforcement learning
Figure 3 for Total stochastic gradient algorithms and applications in reinforcement learning
Figure 4 for Total stochastic gradient algorithms and applications in reinforcement learning
Viaarxiv icon

PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos

Add code
Bookmark button
Alert button
Feb 04, 2019
Paavo Parmas, Carl Edward Rasmussen, Jan Peters, Kenji Doya

Figure 1 for PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
Figure 2 for PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
Figure 3 for PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
Figure 4 for PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
Viaarxiv icon