Picture for Tim Pearce

Tim Pearce

Reconciling Kaplan and Chinchilla Scaling Laws

Jun 12, 2024
Viaarxiv icon

Diffusion for World Modeling: Visual Details Matter in Atari

Add code
May 20, 2024
Viaarxiv icon

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Feb 26, 2024
Viaarxiv icon

Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

Oct 26, 2023
Figure 1 for Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Figure 2 for Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Figure 3 for Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Figure 4 for Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Viaarxiv icon

Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing

Oct 26, 2023
Figure 1 for Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
Figure 2 for Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
Figure 3 for Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
Figure 4 for Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
Viaarxiv icon

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

Add code
Feb 21, 2023
Figure 1 for TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Figure 2 for TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Figure 3 for TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Figure 4 for TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Jan 25, 2023
Figure 1 for Imitating Human Behaviour with Diffusion Models
Figure 2 for Imitating Human Behaviour with Diffusion Models
Figure 3 for Imitating Human Behaviour with Diffusion Models
Figure 4 for Imitating Human Behaviour with Diffusion Models
Viaarxiv icon

Censored Quantile Regression Neural Networks

Add code
May 26, 2022
Figure 1 for Censored Quantile Regression Neural Networks
Figure 2 for Censored Quantile Regression Neural Networks
Figure 3 for Censored Quantile Regression Neural Networks
Figure 4 for Censored Quantile Regression Neural Networks
Viaarxiv icon

Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection

Add code
Jul 28, 2021
Figure 1 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 2 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 3 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Figure 4 for Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection
Viaarxiv icon

Understanding Softmax Confidence and Uncertainty

Add code
Jun 09, 2021
Figure 1 for Understanding Softmax Confidence and Uncertainty
Figure 2 for Understanding Softmax Confidence and Uncertainty
Figure 3 for Understanding Softmax Confidence and Uncertainty
Figure 4 for Understanding Softmax Confidence and Uncertainty
Viaarxiv icon