Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yarin Gal

Fine-tuning can cripple your foundation model; preserving features may be the solution

Aug 25, 2023

Jishnu Mukhoti, Yarin Gal, Philip H. S. Torr, Puneet K. Dokania

Abstract:Pre-trained foundation models, owing primarily to their enormous capacity and exposure to vast amount of training data scraped from the internet, enjoy the advantage of storing knowledge about plenty of real-world concepts. Such models are typically fine-tuned on downstream datasets to produce remarkable state-of-the-art performances. While various fine-tuning methods have been devised and are shown to be highly effective, we observe that a fine-tuned model's ability to recognize concepts on tasks $\textit{different}$ from the downstream one is reduced significantly compared to its pre-trained counterpart. This is clearly undesirable as a huge amount of time and money went into learning those very concepts in the first place. We call this undesirable phenomenon "concept forgetting" and via experiments show that most end-to-end fine-tuning approaches suffer heavily from this side effect. To this end, we also propose a rather simple fix to this problem by designing a method called LDIFS (short for $\ell_2$ distance in feature space) that simply preserves the features of the original foundation model during fine-tuning. We show that LDIFS significantly reduces concept forgetting without having noticeable impact on the downstream task performance.

Via

Access Paper or Ask Questions

In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Aug 07, 2023

Jannik Kossen, Tom Rainforth, Yarin Gal

Figure 1 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 2 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 3 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 4 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Abstract:The performance of Large Language Models (LLMs) on downstream tasks often improves significantly when including examples of the input-label relationship in the context. However, there is currently no consensus about how this in-context learning (ICL) ability of LLMs works: for example, while Xie et al. (2021) liken ICL to a general-purpose learning algorithm, Min et al. (2022b) argue ICL does not even learn label relationships from in-context examples. In this paper, we study (1) how labels of in-context examples affect predictions, (2) how label relationships learned during pre-training interact with input-label examples provided in-context, and (3) how ICL aggregates label information across in-context examples. Our findings suggests LLMs usually incorporate information from in-context labels, but that pre-training and in-context label relationships are treated differently, and that the model does not consider all in-context information equally. Our results give insights into understanding and aligning LLM behavior.

Via

Access Paper or Ask Questions

LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?

Jul 20, 2023

David Glukhov, Ilia Shumailov, Yarin Gal, Nicolas Papernot, Vardan Papyan

Abstract:Large language models (LLMs) have exhibited impressive capabilities in comprehending complex instructions. However, their blind adherence to provided instructions has led to concerns regarding risks of malicious use. Existing defence mechanisms, such as model fine-tuning or output censorship using LLMs, have proven to be fallible, as LLMs can still generate problematic responses. Commonly employed censorship approaches treat the issue as a machine learning problem and rely on another LM to detect undesirable content in LLM outputs. In this paper, we present the theoretical limitations of such semantic censorship approaches. Specifically, we demonstrate that semantic censorship can be perceived as an undecidable problem, highlighting the inherent challenges in censorship that arise due to LLMs' programmatic and instruction-following capabilities. Furthermore, we argue that the challenges extend beyond semantic censorship, as knowledgeable attackers can reconstruct impermissible outputs from a collection of permissible ones. As a result, we propose that the problem of censorship needs to be reevaluated; it should be treated as a security problem which warrants the adaptation of security-based approaches to mitigate potential risks.

Via

Access Paper or Ask Questions

BatchGFN: Generative Flow Networks for Batch Active Learning

Jun 26, 2023

Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

Figure 1 for BatchGFN: Generative Flow Networks for Batch Active Learning

Figure 2 for BatchGFN: Generative Flow Networks for Batch Active Learning

Figure 3 for BatchGFN: Generative Flow Networks for Batch Active Learning

Figure 4 for BatchGFN: Generative Flow Networks for Batch Active Learning

Abstract:We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active learning in a principled way. We show our approach enables sampling near-optimal utility batches at inference time with a single forward pass per point in the batch in toy regression problems. This alleviates the computational complexity of batch-aware algorithms and removes the need for greedy approximations to find maximizers for the batch reward. We also present early results for amortizing training across acquisition steps, which will enable scaling to real-world tasks.

* Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

Via

Access Paper or Ask Questions

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Jun 12, 2023

Andrew Jesson, Chris Lu, Gunshi Gupta, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal

Figure 1 for ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Figure 2 for ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Figure 3 for ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Figure 4 for ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

Abstract:This paper introduces a novel method for enhancing the effectiveness of on-policy Deep Reinforcement Learning (DRL) algorithms. Three surprisingly simple modifications to the A3C algorithm: (1) processing advantage estimates through a ReLU function, (2) spectral normalization, and (3) dropout, serve to not only improve efficacy but also yield a ``cautious'' DRL algorithm. Where on-policy algorithms such as Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C) do not explicitly account for cautious interaction with the environment, our method integrates caution in two critical ways: (1) by maximizing a lower bound on the value function plus a constant, thereby promoting a \textit{conservative value estimation}, and (2) by incorporating Thompson sampling for cautious exploration. In proving that our algorithm maximizes the lower bound, we also ground Regret Matching Policy Gradients (RMPG), a discrete-action on-policy method for multi-agent reinforcement learning. Our rigorous empirical evaluations across various benchmarks demonstrate our approach's improved performance against existing on-policy algorithms. This research represents a substantial step towards efficacious and cautious DRL algorithms, which are needed to unlock applications to complex, real-world problems.

Via

Access Paper or Ask Questions

The Curse of Recursion: Training on Generated Data Makes Models Forget

May 31, 2023

Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Yarin Gal, Nicolas Papernot, Ross Anderson

Figure 1 for The Curse of Recursion: Training on Generated Data Makes Models Forget

Figure 2 for The Curse of Recursion: Training on Generated Data Makes Models Forget

Figure 3 for The Curse of Recursion: Training on Generated Data Makes Models Forget

Figure 4 for The Curse of Recursion: Training on Generated Data Makes Models Forget

Abstract:Stable Diffusion revolutionised image creation from descriptive text. GPT-2, GPT-3(.5) and GPT-4 demonstrated astonishing performance across a variety of language tasks. ChatGPT introduced such language models to the general public. It is now clear that large language models (LLMs) are here to stay, and will bring about drastic change in the whole ecosystem of online text and images. In this paper we consider what the future might hold. What will happen to GPT-{n} once LLMs contribute much of the language found online? We find that use of model-generated content in training causes irreversible defects in the resulting models, where tails of the original content distribution disappear. We refer to this effect as Model Collapse and show that it can occur in Variational Autoencoders, Gaussian Mixture Models and LLMs. We build theoretical intuition behind the phenomenon and portray its ubiquity amongst all learned generative models. We demonstrate that it has to be taken seriously if we are to sustain the benefits of training from large-scale data scraped from the web. Indeed, the value of data collected about genuine human interactions with systems will be increasingly valuable in the presence of content generated by LLMs in data crawled from the Internet.

Via

Access Paper or Ask Questions

Prediction-Oriented Bayesian Active Learning

Apr 17, 2023

Freddie Bickford Smith, Andreas Kirsch, Sebastian Farquhar, Yarin Gal, Adam Foster, Tom Rainforth

Figure 1 for Prediction-Oriented Bayesian Active Learning

Figure 2 for Prediction-Oriented Bayesian Active Learning

Figure 3 for Prediction-Oriented Bayesian Active Learning

Figure 4 for Prediction-Oriented Bayesian Active Learning

Abstract:Information-theoretic approaches to active learning have traditionally focused on maximising the information gathered about the model parameters, most commonly by optimising the BALD score. We highlight that this can be suboptimal from the perspective of predictive performance. For example, BALD lacks a notion of an input distribution and so is prone to prioritise data of limited relevance. To address this we propose the expected predictive information gain (EPIG), an acquisition function that measures information gain in the space of predictions rather than parameters. We find that using EPIG leads to stronger predictive performance compared with BALD across a range of datasets and models, and thus provides an appealing drop-in replacement.

* Published at AISTATS 2023

Via

Access Paper or Ask Questions

Revisiting Automated Prompting: Are We Actually Doing Better?

Apr 07, 2023

Yulin Zhou, Yiren Zhao, Ilia Shumailov, Robert Mullins, Yarin Gal

Abstract:Current literature demonstrates that Large Language Models (LLMs) are great few-shot learners, and prompting significantly increases their performance on a range of downstream tasks in a few-shot learning setting. An attempt to automate human-led prompting followed, with some progress achieved. In particular, subsequent work demonstrates automation can outperform fine-tuning in certain K-shot learning scenarios. In this paper, we revisit techniques for automated prompting on six different downstream tasks and a larger range of K-shot learning settings. We find that automated prompting does not consistently outperform simple manual prompts. Our work suggests that, in addition to fine-tuning, manual prompts should be used as a baseline in this line of research.

Via

Access Paper or Ask Questions

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Feb 21, 2023

Lorenz Kuhn, Yarin Gal, Sebastian Farquhar

Figure 1 for Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Figure 2 for Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Figure 3 for Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Figure 4 for Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Abstract:We introduce a method to measure uncertainty in large language models. For tasks like question answering, it is essential to know when we can trust the natural language outputs of foundation models. We show that measuring uncertainty in natural language is challenging because of "semantic equivalence" -- different sentences can mean the same thing. To overcome these challenges we introduce semantic entropy -- an entropy which incorporates linguistic invariances created by shared meanings. Our method is unsupervised, uses only a single model, and requires no modifications to off-the-shelf language models. In comprehensive ablation studies we show that the semantic entropy is more predictive of model accuracy on question answering data sets than comparable baselines.

Via

Access Paper or Ask Questions

Differentiable Multi-Target Causal Bayesian Experimental Design

Feb 21, 2023

Yashas Annadani, Panagiotis Tigas, Desi R. Ivanova, Andrew Jesson, Yarin Gal, Adam Foster, Stefan Bauer

Figure 1 for Differentiable Multi-Target Causal Bayesian Experimental Design

Figure 2 for Differentiable Multi-Target Causal Bayesian Experimental Design

Figure 3 for Differentiable Multi-Target Causal Bayesian Experimental Design

Figure 4 for Differentiable Multi-Target Causal Bayesian Experimental Design

Abstract:We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting -- a critical component for causal discovery from finite data where interventions can be costly or risky. Existing methods rely on greedy approximations to construct a batch of experiments while using black-box methods to optimize over a single target-state pair to intervene with. In this work, we completely dispose of the black-box optimization techniques and greedy heuristics and instead propose a conceptually simple end-to-end gradient-based optimization procedure to acquire a set of optimal intervention target-state pairs. Such a procedure enables parameterization of the design space to efficiently optimize over a batch of multi-target-state interventions, a setting which has hitherto not been explored due to its complexity. We demonstrate that our proposed method outperforms baselines and existing acquisition strategies in both single-target and multi-target settings across a number of synthetic datasets.

Via

Access Paper or Ask Questions