Alert button
Picture for Amir-massoud Farahmand

Amir-massoud Farahmand

Alert button

Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence

Add code
Bookmark button
Alert button
Mar 09, 2024
Marcel Hussing, Claas Voelcker, Igor Gilitschenski, Amir-massoud Farahmand, Eric Eaton

Figure 1 for Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
Figure 2 for Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
Figure 3 for Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
Figure 4 for Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
Viaarxiv icon

Improving Adversarial Transferability via Model Alignment

Add code
Bookmark button
Alert button
Nov 30, 2023
Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu

Viaarxiv icon

Maximum Entropy Model Correction in Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 29, 2023
Amin Rakhsha, Mete Kemertas, Mohammad Ghavamzadeh, Amir-massoud Farahmand

Viaarxiv icon

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Add code
Bookmark button
Alert button
Aug 13, 2023
Avery Ma, Yangchen Pan, Amir-massoud Farahmand

Figure 1 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 2 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 3 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 4 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Viaarxiv icon

Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients

Add code
Bookmark button
Alert button
Jul 17, 2023
Mete Kemertas, Allan D. Jepson, Amir-massoud Farahmand

Viaarxiv icon

Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 04, 2023
Tyler Kastner, Murat A. Erdogdu, Amir-massoud Farahmand

Figure 1 for Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning
Figure 2 for Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning
Figure 3 for Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning
Viaarxiv icon

$λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces

Add code
Bookmark button
Alert button
Jun 30, 2023
Claas A Voelcker, Arash Ahmadian, Romina Abachi, Igor Gilitschenski, Amir-massoud Farahmand

Figure 1 for $λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
Figure 2 for $λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
Figure 3 for $λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
Figure 4 for $λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
Viaarxiv icon

Operator Splitting Value Iteration

Add code
Bookmark button
Alert button
Nov 25, 2022
Amin Rakhsha, Andrew Wang, Mohammad Ghavamzadeh, Amir-massoud Farahmand

Figure 1 for Operator Splitting Value Iteration
Figure 2 for Operator Splitting Value Iteration
Figure 3 for Operator Splitting Value Iteration
Figure 4 for Operator Splitting Value Iteration
Viaarxiv icon

Value Gradient weighted Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 04, 2022
Claas Voelcker, Victor Liao, Animesh Garg, Amir-massoud Farahmand

Figure 1 for Value Gradient weighted Model-Based Reinforcement Learning
Figure 2 for Value Gradient weighted Model-Based Reinforcement Learning
Figure 3 for Value Gradient weighted Model-Based Reinforcement Learning
Figure 4 for Value Gradient weighted Model-Based Reinforcement Learning
Viaarxiv icon

Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations

Add code
Bookmark button
Alert button
Oct 23, 2021
Erfan Pirmorad, Faraz Khoshbakhtian, Farnam Mansouri, Amir-massoud Farahmand

Figure 1 for Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations
Figure 2 for Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations
Figure 3 for Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations
Figure 4 for Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations
Viaarxiv icon