Alert button
Picture for Haitham Bou-Ammar

Haitham Bou-Ammar

Alert button

Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications

Add code
Bookmark button
Alert button
Apr 13, 2024
Puze Liu, Haitham Bou-Ammar, Jan Peters, Davide Tateo

Viaarxiv icon

ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization

Add code
Bookmark button
Alert button
Mar 04, 2024
Yao Zhao, Tao Wu, Yijie Zhu, Xiang Lu, Jun Wang, Haitham Bou-Ammar, Xinyu Zhang, Peng Du

Figure 1 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 2 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 3 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 4 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Viaarxiv icon

Bayesian Reward Models for LLM Alignment

Add code
Bookmark button
Alert button
Feb 20, 2024
Adam X. Yang, Maxime Robeyns, Thomas Coste, Jun Wang, Haitham Bou-Ammar, Laurence Aitchison

Viaarxiv icon

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Bookmark button
Alert button
Dec 22, 2023
Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang

Viaarxiv icon

Why Can Large Language Models Generate Correct Chain-of-Thoughts?

Add code
Bookmark button
Alert button
Oct 30, 2023
Rasul Tutunov, Antoine Grosnit, Juliusz Ziomek, Jun Wang, Haitham Bou-Ammar

Viaarxiv icon

Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?

Add code
Bookmark button
Alert button
Jan 30, 2023
Juliusz Ziomek, Haitham Bou-Ammar

Figure 1 for Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?
Figure 2 for Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?
Figure 3 for Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?
Figure 4 for Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?
Viaarxiv icon

Contextual Causal Bayesian Optimisation

Add code
Bookmark button
Alert button
Jan 29, 2023
Vahan Arsenyan, Antoine Grosnit, Haitham Bou-Ammar

Figure 1 for Contextual Causal Bayesian Optimisation
Figure 2 for Contextual Causal Bayesian Optimisation
Figure 3 for Contextual Causal Bayesian Optimisation
Figure 4 for Contextual Causal Bayesian Optimisation
Viaarxiv icon

Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks

Add code
Bookmark button
Alert button
Jan 12, 2023
Piotr Kicki, Puze Liu, Davide Tateo, Haitham Bou-Ammar, Krzysztof Walas, Piotr Skrzypczyński, Jan Peters

Figure 1 for Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks
Figure 2 for Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks
Figure 3 for Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks
Figure 4 for Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks
Viaarxiv icon

AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation

Add code
Bookmark button
Alert button
Feb 16, 2022
Asif Khan, Alexander I. Cowen-Rivers, Derrick-Goh-Xin Deik, Antoine Grosnit, Kamil Dreczkowski, Philippe A. Robert, Victor Greiff, Rasul Tutunov, Dany Bou-Ammar, Jun Wang, Haitham Bou-Ammar

Figure 1 for AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
Figure 2 for AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
Figure 3 for AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
Figure 4 for AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
Viaarxiv icon