Picture for Mohammad Ghavamzadeh

Mohammad Ghavamzadeh

INRIA Lille - Nord Europe

A Review of Deep Learning for Video Captioning

Add code
Apr 22, 2023
Figure 1 for A Review of Deep Learning for Video Captioning
Figure 2 for A Review of Deep Learning for Video Captioning
Figure 3 for A Review of Deep Learning for Video Captioning
Figure 4 for A Review of Deep Learning for Video Captioning
Viaarxiv icon

Aligning Text-to-Image Models using Human Feedback

Add code
Feb 23, 2023
Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Add code
Feb 21, 2023
Viaarxiv icon

Multi-Task Off-Policy Learning from Bandit Feedback

Add code
Dec 09, 2022
Figure 1 for Multi-Task Off-Policy Learning from Bandit Feedback
Figure 2 for Multi-Task Off-Policy Learning from Bandit Feedback
Figure 3 for Multi-Task Off-Policy Learning from Bandit Feedback
Viaarxiv icon

Operator Splitting Value Iteration

Add code
Nov 25, 2022
Viaarxiv icon

RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk

Add code
Sep 14, 2022
Figure 1 for RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk
Figure 2 for RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk
Figure 3 for RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk
Figure 4 for RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk
Viaarxiv icon

Robust Reinforcement Learning using Offline Data

Add code
Aug 10, 2022
Figure 1 for Robust Reinforcement Learning using Offline Data
Figure 2 for Robust Reinforcement Learning using Offline Data
Figure 3 for Robust Reinforcement Learning using Offline Data
Figure 4 for Robust Reinforcement Learning using Offline Data
Viaarxiv icon

Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

Add code
Jul 01, 2022
Figure 1 for Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings
Figure 2 for Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings
Figure 3 for Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings
Figure 4 for Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings
Viaarxiv icon

A Mixture-of-Expert Approach to RL-based Dialogue Management

Add code
May 31, 2022
Figure 1 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 2 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 3 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 4 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Viaarxiv icon

Collaborative Multi-agent Stochastic Linear Bandits

Add code
May 12, 2022
Figure 1 for Collaborative Multi-agent Stochastic Linear Bandits
Figure 2 for Collaborative Multi-agent Stochastic Linear Bandits
Viaarxiv icon