Alert button
Picture for Jose A. Arjona-Medina

Jose A. Arjona-Medina

Alert button

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

Add code
Bookmark button
Alert button
Sep 29, 2020
Vihang P. Patil, Markus Hofmarcher, Marius-Constantin Dinu, Matthias Dorfer, Patrick M. Blies, Johannes Brandstetter, Jose A. Arjona-Medina, Sepp Hochreiter

Figure 1 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 2 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 3 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 4 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Viaarxiv icon

Explaining and Interpreting LSTMs

Add code
Bookmark button
Alert button
Sep 25, 2019
Leila Arras, Jose A. Arjona-Medina, Michael Widrich, Grégoire Montavon, Michael Gillhofer, Klaus-Robert Müller, Sepp Hochreiter, Wojciech Samek

Figure 1 for Explaining and Interpreting LSTMs
Figure 2 for Explaining and Interpreting LSTMs
Figure 3 for Explaining and Interpreting LSTMs
Figure 4 for Explaining and Interpreting LSTMs
Viaarxiv icon

RUDDER: Return Decomposition for Delayed Rewards

Add code
Bookmark button
Alert button
Jun 20, 2018
Jose A. Arjona-Medina, Michael Gillhofer, Michael Widrich, Thomas Unterthiner, Sepp Hochreiter

Figure 1 for RUDDER: Return Decomposition for Delayed Rewards
Figure 2 for RUDDER: Return Decomposition for Delayed Rewards
Figure 3 for RUDDER: Return Decomposition for Delayed Rewards
Figure 4 for RUDDER: Return Decomposition for Delayed Rewards
Viaarxiv icon