Alert button

Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

Mar 13, 2024
Jingling Li, Zeyu Tang, Xiaoyu Liu, Peter Spirtes, Kun Zhang, Liu Leqi, Yang Liu

Figure 1 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 2 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 3 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework
Figure 4 for Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: