Picture for Yinlam Chow

Yinlam Chow

Dima

A Mixture-of-Expert Approach to RL-based Dialogue Management

Add code
May 31, 2022
Figure 1 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 2 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 3 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Figure 4 for A Mixture-of-Expert Approach to RL-based Dialogue Management
Viaarxiv icon

Efficient Risk-Averse Reinforcement Learning

Add code
May 10, 2022
Figure 1 for Efficient Risk-Averse Reinforcement Learning
Figure 2 for Efficient Risk-Averse Reinforcement Learning
Figure 3 for Efficient Risk-Averse Reinforcement Learning
Figure 4 for Efficient Risk-Averse Reinforcement Learning
Viaarxiv icon

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

Add code
Feb 10, 2022
Figure 1 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 2 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 3 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Figure 4 for SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Viaarxiv icon

Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors

Add code
Feb 06, 2022
Figure 1 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 2 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 3 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Figure 4 for Discovering Personalized Semantics for Soft Attributes in Recommender Systems using Concept Activation Vectors
Viaarxiv icon

Non-Stationary Latent Bandits

Add code
Dec 01, 2020
Figure 1 for Non-Stationary Latent Bandits
Figure 2 for Non-Stationary Latent Bandits
Figure 3 for Non-Stationary Latent Bandits
Viaarxiv icon

CoinDICE: Off-Policy Confidence Interval Estimation

Add code
Oct 22, 2020
Figure 1 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 2 for CoinDICE: Off-Policy Confidence Interval Estimation
Figure 3 for CoinDICE: Off-Policy Confidence Interval Estimation
Viaarxiv icon

Safe Reinforcement Learning with Natural Language Constraints

Add code
Oct 11, 2020
Figure 1 for Safe Reinforcement Learning with Natural Language Constraints
Figure 2 for Safe Reinforcement Learning with Natural Language Constraints
Figure 3 for Safe Reinforcement Learning with Natural Language Constraints
Figure 4 for Safe Reinforcement Learning with Natural Language Constraints
Viaarxiv icon

Variational Model-based Policy Optimization

Add code
Jun 24, 2020
Figure 1 for Variational Model-based Policy Optimization
Figure 2 for Variational Model-based Policy Optimization
Figure 3 for Variational Model-based Policy Optimization
Figure 4 for Variational Model-based Policy Optimization
Viaarxiv icon

Control-Aware Representations for Model-based Reinforcement Learning

Add code
Jun 24, 2020
Figure 1 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 2 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 3 for Control-Aware Representations for Model-based Reinforcement Learning
Figure 4 for Control-Aware Representations for Model-based Reinforcement Learning
Viaarxiv icon

Latent Bandits Revisited

Add code
Jun 15, 2020
Figure 1 for Latent Bandits Revisited
Figure 2 for Latent Bandits Revisited
Viaarxiv icon