Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Georgia Channing

Building Reliable Long-Form Generation via Hallucination Rejection Sampling

Jun 02, 2026

Lin Li, Georgia Channing, Suhaas M Bhat, Gabriel Davis Jones, Yarin Gal

Abstract:Large language models (LLMs) have achieved remarkable progress in open-ended text generation, yet they remain prone to hallucinating incorrect or unsupported content, which undermines their reliability. This issue is exacerbated in long-form generation due to hallucination snowballing, a phenomenon where early errors propagate and compound into subsequent outputs. To address this challenge, we propose a novel inference-time hallucination mitigation framework, named Segment-wise HAllucination Rejection Sampling (SHARS), which uses an arbitrary hallucination detector to identify and reject hallucinated segments during generation and resample until faithful content is produced. By retaining only confident information and building subsequent generations upon it, the framework mitigates hallucination accumulation and enhances factual consistency. To instantiate this framework, we adopt semantic uncertainty as the detector and introduce several vital modifications to address its limitations and better adapt it to long-form text. Our method enables models to self-correct hallucinations without requiring external resources such as web search or knowledge bases, while remaining compatible with them for future extensions. Empirical evaluations on standardized hallucination benchmarks demonstrate that our method substantially reduces hallucinations in long-form generation while preserving or even improving the informativeness of generation. Code is available at: https://github.com/TreeLLi/hallucination-rejection-sampling.

* accepted by ICML 2026

Via

Access Paper or Ask Questions

Contrastive Image-Metadata Pre-Training for Materials Transmission Electron Microscopy

Apr 27, 2026

Georgia Channing, Debora Keller, Marta D. Rossell, Philip Torr, Rolf Erni, Stig Helveg, Henrik Eliasson

Abstract:The vast majority of transmission electron microscopy (TEM) data never gets published and ends up on a backup drive until deleted to free up space. These left-over datasets are rich in detail and variation, often paired with automatically saved metadata of instrument state and acquisition parameters. In this work, we introduce a dataset of 7,330 high-angle annular dark-field scanning-TEM (HAADF-STEM) images from a single instrument to learn a joint embedding space between image metadata and HAADF image. These embeddings link image style with acquisition parameters, which allows us to train a generative style transfer network that can convert experimental images into the style they would have had if they were recorded with different instrument parameters. We evaluate the performance of the network and explore the usefulness of the technique for physical denoising.

Via

Access Paper or Ask Questions

PSyDUCK: Training-Free Steganography for Latent Diffusion

Jan 31, 2025

Georgia Channing, Aqib Mahfuz, Mark van der Wilk, Philip Torr, Fabio Pizzati, Christian Schroeder de Witt

Figure 1 for PSyDUCK: Training-Free Steganography for Latent Diffusion

Figure 2 for PSyDUCK: Training-Free Steganography for Latent Diffusion

Figure 3 for PSyDUCK: Training-Free Steganography for Latent Diffusion

Figure 4 for PSyDUCK: Training-Free Steganography for Latent Diffusion

Abstract:Recent advances in AI-generated steganography highlight its potential for safeguarding the privacy of vulnerable democratic actors, including aid workers, journalists, and whistleblowers operating in oppressive regimes. In this work, we address current limitations and establish the foundations for large-throughput generative steganography. We introduce a novel approach that enables secure and efficient steganography within latent diffusion models. We show empirically that our methods perform well across a variety of open-source latent diffusion models, particularly in generative image and video tasks.

Via

Access Paper or Ask Questions

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Oct 09, 2024

Georgia Channing, Juil Sock, Ronald Clark, Philip Torr, Christian Schroeder de Witt

Figure 1 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Figure 2 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Figure 3 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Figure 4 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Abstract:The rapid proliferation of AI-manipulated or generated audio deepfakes poses serious challenges to media integrity and election security. Current AI-driven detection solutions lack explainability and underperform in real-world settings. In this paper, we introduce novel explainability methods for state-of-the-art transformer-based audio deepfake detectors and open-source a novel benchmark for real-world generalizability. By narrowing the explainability gap between transformer-based audio deepfake detectors and traditional methods, our results not only build trust with human experts, but also pave the way for unlocking the potential of citizen intelligence to overcome the scalability issue in audio deepfake detection.

Via

Access Paper or Ask Questions