Picture for Jessica Hoffmann

Jessica Hoffmann

Think Before You Lie: How Reasoning Leads to Honesty

Add code
Mar 16, 2026
Viaarxiv icon

Think Before You Lie: How Reasoning Improves Honesty

Add code
Mar 10, 2026
Viaarxiv icon

On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback

Add code
Mar 07, 2025
Figure 1 for On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Figure 2 for On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Figure 3 for On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Figure 4 for On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback
Viaarxiv icon

Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset

Add code
Mar 05, 2025
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Figure 1 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 2 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 3 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 4 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Viaarxiv icon

Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

Add code
Mar 13, 2024
Figure 1 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 2 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 3 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 4 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Viaarxiv icon

Decoding-time Realignment of Language Models

Add code
Feb 05, 2024
Viaarxiv icon

Towards Agile Text Classifiers for Everyone

Add code
Feb 13, 2023
Figure 1 for Towards Agile Text Classifiers for Everyone
Figure 2 for Towards Agile Text Classifiers for Everyone
Figure 3 for Towards Agile Text Classifiers for Everyone
Figure 4 for Towards Agile Text Classifiers for Everyone
Viaarxiv icon

Fairness for Image Generation with Uncertain Sensitive Attributes

Add code
Jul 02, 2021
Figure 1 for Fairness for Image Generation with Uncertain Sensitive Attributes
Figure 2 for Fairness for Image Generation with Uncertain Sensitive Attributes
Figure 3 for Fairness for Image Generation with Uncertain Sensitive Attributes
Figure 4 for Fairness for Image Generation with Uncertain Sensitive Attributes
Viaarxiv icon

Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks

Add code
May 11, 2020
Figure 1 for Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks
Figure 2 for Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks
Figure 3 for Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks
Figure 4 for Adversarial Graph Embeddings for Fair Influence Maximization over Social Networks
Viaarxiv icon