Picture for Jayden Ooi

Jayden Ooi

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities

Add code
May 06, 2021
Figure 1 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 2 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 3 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Figure 4 for Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities
Viaarxiv icon

Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition

Add code
Aug 15, 2020
Figure 1 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 2 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 3 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Figure 4 for Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Viaarxiv icon

ConQUR: Mitigating Delusional Bias in Deep Q-learning

Add code
Feb 27, 2020
Figure 1 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 2 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 3 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Figure 4 for ConQUR: Mitigating Delusional Bias in Deep Q-learning
Viaarxiv icon

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Add code
Feb 12, 2020
Figure 1 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 2 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 3 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Figure 4 for Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing
Viaarxiv icon

BRPO: Batch Residual Policy Optimization

Add code
Feb 08, 2020
Figure 1 for BRPO: Batch Residual Policy Optimization
Figure 2 for BRPO: Batch Residual Policy Optimization
Figure 3 for BRPO: Batch Residual Policy Optimization
Figure 4 for BRPO: Batch Residual Policy Optimization
Viaarxiv icon

Advantage Amplification in Slowly Evolving Latent-State Environments

Add code
May 29, 2019
Figure 1 for Advantage Amplification in Slowly Evolving Latent-State Environments
Figure 2 for Advantage Amplification in Slowly Evolving Latent-State Environments
Figure 3 for Advantage Amplification in Slowly Evolving Latent-State Environments
Viaarxiv icon