Alert button
Picture for Jayden Ooi

Jayden Ooi

Alert button

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities

May 06, 2021
Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen

Figure 1 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 2 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 3 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 4 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Viaarxiv icon

Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition

Aug 15, 2020
Henry Tsai, Jayden Ooi, Chun-Sung Ferng, Hyung Won Chung, Jason Riesa

Figure 1 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 2 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 3 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 4 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Viaarxiv icon

ConQUR: Mitigating Delusional Bias in Deep Q-learning

Feb 27, 2020
Andy Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier

Figure 1 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 2 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 3 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 4 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Viaarxiv icon

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Feb 12, 2020
Ge Liu, Rui Wu, Heng-Tze Cheng, Jing Wang, Jayden Ooi, Lihong Li, Ang Li, Wai Lok Sibon Li, Craig Boutilier, Ed Chi

Figure 1 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 2 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 3 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 4 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Feb 08, 2020
Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier

Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon

Advantage Amplification in Slowly Evolving Latent-State Environments

May 29, 2019
Martin Mladenov, Ofer Meshi, Jayden Ooi, Dale Schuurmans, Craig Boutilier

Figure 1 for Advantage Amplification in Slowly Evolving Latent-State Environments
Figure 2 for Advantage Amplification in Slowly Evolving Latent-State Environments
Figure 3 for Advantage Amplification in Slowly Evolving Latent-State Environments
Viaarxiv icon