Picture for Chris Cundy

Chris Cundy

Preference Learning with Lie Detectors can Induce Honesty or Evasion

Add code
May 20, 2025
Viaarxiv icon

Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF

Add code
Mar 28, 2025
Viaarxiv icon

No, of course I can! Refusal Mechanisms Can Be Exploited Using Harmless Fine-Tuning Data

Add code
Feb 26, 2025
Viaarxiv icon

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

Add code
Jun 19, 2023
Viaarxiv icon

On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence

Add code
Apr 13, 2023
Viaarxiv icon

LMPriors: Pre-Trained Language Models as Task-Specific Priors

Add code
Oct 22, 2022
Viaarxiv icon

Beyond Bayes-optimality: meta-learning what you know you don't know

Add code
Oct 12, 2022
Figure 1 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 2 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 3 for Beyond Bayes-optimality: meta-learning what you know you don't know
Figure 4 for Beyond Bayes-optimality: meta-learning what you know you don't know
Viaarxiv icon

BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery

Add code
Dec 06, 2021
Figure 1 for BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
Figure 2 for BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
Figure 3 for BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
Figure 4 for BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery
Viaarxiv icon

IQ-Learn: Inverse soft-Q Learning for Imitation

Add code
Jun 23, 2021
Figure 1 for IQ-Learn: Inverse soft-Q Learning for Imitation
Figure 2 for IQ-Learn: Inverse soft-Q Learning for Imitation
Figure 3 for IQ-Learn: Inverse soft-Q Learning for Imitation
Figure 4 for IQ-Learn: Inverse soft-Q Learning for Imitation
Viaarxiv icon

Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients

Add code
Jan 02, 2021
Figure 1 for Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Figure 2 for Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Figure 3 for Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Figure 4 for Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
Viaarxiv icon