Alert button
Picture for Aivar Sootla

Aivar Sootla

Alert button

Structured Q-learning For Antibody Design

Add code
Bookmark button
Alert button
Sep 13, 2022
Alexander I. Cowen-Rivers, Philip John Gorinski, Aivar Sootla, Asif Khan, Liu Furui, Jun Wang, Jan Peters, Haitham Bou Ammar

Figure 1 for Structured Q-learning For Antibody Design
Figure 2 for Structured Q-learning For Antibody Design
Figure 3 for Structured Q-learning For Antibody Design
Figure 4 for Structured Q-learning For Antibody Design
Viaarxiv icon

Enhancing Safe Exploration Using Safety State Augmentation

Add code
Bookmark button
Alert button
Jun 06, 2022
Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

Figure 1 for Enhancing Safe Exploration Using Safety State Augmentation
Figure 2 for Enhancing Safe Exploration Using Safety State Augmentation
Figure 3 for Enhancing Safe Exploration Using Safety State Augmentation
Figure 4 for Enhancing Safe Exploration Using Safety State Augmentation
Viaarxiv icon

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

Add code
Bookmark button
Alert button
Jun 06, 2022
David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

Figure 1 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 2 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 3 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Figure 4 for Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
Viaarxiv icon

SEREN: Knowing When to Explore and When to Exploit

Add code
Bookmark button
Alert button
May 30, 2022
Changmin Yu, David Mguni, Dong Li, Aivar Sootla, Jun Wang, Neil Burgess

Figure 1 for SEREN: Knowing When to Explore and When to Exploit
Figure 2 for SEREN: Knowing When to Explore and When to Exploit
Figure 3 for SEREN: Knowing When to Explore and When to Exploit
Figure 4 for SEREN: Knowing When to Explore and When to Exploit
Viaarxiv icon

SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

Add code
Bookmark button
Alert button
Feb 16, 2022
Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Figure 1 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 2 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 3 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Figure 4 for SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Viaarxiv icon

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

Add code
Bookmark button
Alert button
Feb 14, 2022
Hang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar

Figure 1 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 2 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 3 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Figure 4 for Reinforcement Learning in Presence of Discrete Markovian Context Evolution
Viaarxiv icon

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

Add code
Bookmark button
Alert button
Oct 27, 2021
David Mguni, Joel Jennings, Taher Jafferjee, Aivar Sootla, Yaodong Yang, Changmin Yu, Usman Islam, Ziyan Wang, Jun Wang

Figure 1 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 2 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 3 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 4 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Viaarxiv icon

Implicit Variational Conditional Sampling with Normalizing Flows

Add code
Bookmark button
Alert button
Jul 06, 2021
Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang

Figure 1 for Implicit Variational Conditional Sampling with Normalizing Flows
Figure 2 for Implicit Variational Conditional Sampling with Normalizing Flows
Figure 3 for Implicit Variational Conditional Sampling with Normalizing Flows
Viaarxiv icon