Alert button
Picture for Pascal Poupart

Pascal Poupart

Alert button

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Add code
Bookmark button
Alert button
Mar 20, 2024
Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart

Figure 1 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 2 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 3 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 4 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Viaarxiv icon

Why Online Reinforcement Learning is Causal

Add code
Bookmark button
Alert button
Mar 07, 2024
Oliver Schulte, Pascal Poupart

Figure 1 for Why Online Reinforcement Learning is Causal
Figure 2 for Why Online Reinforcement Learning is Causal
Figure 3 for Why Online Reinforcement Learning is Causal
Figure 4 for Why Online Reinforcement Learning is Causal
Viaarxiv icon

A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?

Add code
Bookmark button
Alert button
Feb 07, 2024
Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta, Pascal Poupart, Alán Aspuru-Guzik, Geoff Pleiss

Viaarxiv icon

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

Add code
Bookmark button
Alert button
Dec 15, 2023
Mohsin Hasan, Guojun Zhang, Kaiyang Guo, Xi Chen, Pascal Poupart

Viaarxiv icon

Preventing Arbitrarily High Confidence on Far-Away Data in Point-Estimated Discriminative Neural Networks

Add code
Bookmark button
Alert button
Nov 07, 2023
Ahmad Rashid, Serena Hacker, Guojun Zhang, Agustinus Kristiadi, Pascal Poupart

Viaarxiv icon

An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient

Add code
Bookmark button
Alert button
Aug 09, 2023
Yudong Luo, Guiliang Liu, Pascal Poupart, Yangchen Pan

Figure 1 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 2 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 3 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 4 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Viaarxiv icon

Attribute Controlled Dialogue Prompting

Add code
Bookmark button
Alert button
Jul 11, 2023
Runcheng Liu, Ahmad Rashid, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart

Figure 1 for Attribute Controlled Dialogue Prompting
Figure 2 for Attribute Controlled Dialogue Prompting
Figure 3 for Attribute Controlled Dialogue Prompting
Figure 4 for Attribute Controlled Dialogue Prompting
Viaarxiv icon

Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization

Add code
Bookmark button
Alert button
Dec 12, 2022
Aref Jafari, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart, Ali Ghodsi

Figure 1 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 2 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 3 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 4 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Viaarxiv icon

Label Alignment Regularization for Distribution Shift

Add code
Bookmark button
Alert button
Nov 27, 2022
Ehsan Imani, Guojun Zhang, Jun Luo, Pascal Poupart, Yangchen Pan

Figure 1 for Label Alignment Regularization for Distribution Shift
Figure 2 for Label Alignment Regularization for Distribution Shift
Figure 3 for Label Alignment Regularization for Distribution Shift
Figure 4 for Label Alignment Regularization for Distribution Shift
Viaarxiv icon