Picture for Karina Nguyen

Karina Nguyen

Evaluating and Mitigating Discrimination in Language Model Decisions

Add code
Dec 06, 2023
Figure 1 for Evaluating and Mitigating Discrimination in Language Model Decisions
Figure 2 for Evaluating and Mitigating Discrimination in Language Model Decisions
Figure 3 for Evaluating and Mitigating Discrimination in Language Model Decisions
Figure 4 for Evaluating and Mitigating Discrimination in Language Model Decisions
Viaarxiv icon

Specific versus General Principles for Constitutional AI

Add code
Oct 20, 2023
Figure 1 for Specific versus General Principles for Constitutional AI
Figure 2 for Specific versus General Principles for Constitutional AI
Figure 3 for Specific versus General Principles for Constitutional AI
Figure 4 for Specific versus General Principles for Constitutional AI
Viaarxiv icon

Studying Large Language Model Generalization with Influence Functions

Add code
Aug 07, 2023
Figure 1 for Studying Large Language Model Generalization with Influence Functions
Figure 2 for Studying Large Language Model Generalization with Influence Functions
Figure 3 for Studying Large Language Model Generalization with Influence Functions
Figure 4 for Studying Large Language Model Generalization with Influence Functions
Viaarxiv icon

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Add code
Jul 25, 2023
Figure 1 for Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Figure 2 for Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Figure 3 for Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Figure 4 for Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Viaarxiv icon

Measuring Faithfulness in Chain-of-Thought Reasoning

Add code
Jul 17, 2023
Figure 1 for Measuring Faithfulness in Chain-of-Thought Reasoning
Figure 2 for Measuring Faithfulness in Chain-of-Thought Reasoning
Figure 3 for Measuring Faithfulness in Chain-of-Thought Reasoning
Figure 4 for Measuring Faithfulness in Chain-of-Thought Reasoning
Viaarxiv icon

Vision Transformers for Mobile Applications: A Short Survey

Add code
May 30, 2023
Figure 1 for Vision Transformers for Mobile Applications: A Short Survey
Figure 2 for Vision Transformers for Mobile Applications: A Short Survey
Figure 3 for Vision Transformers for Mobile Applications: A Short Survey
Viaarxiv icon

FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling

Add code
Mar 01, 2023
Figure 1 for FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling
Figure 2 for FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling
Figure 3 for FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling
Figure 4 for FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling
Viaarxiv icon

The Capacity for Moral Self-Correction in Large Language Models

Add code
Feb 18, 2023
Figure 1 for The Capacity for Moral Self-Correction in Large Language Models
Figure 2 for The Capacity for Moral Self-Correction in Large Language Models
Figure 3 for The Capacity for Moral Self-Correction in Large Language Models
Figure 4 for The Capacity for Moral Self-Correction in Large Language Models
Viaarxiv icon

Discovering Language Model Behaviors with Model-Written Evaluations

Add code
Dec 19, 2022
Figure 1 for Discovering Language Model Behaviors with Model-Written Evaluations
Figure 2 for Discovering Language Model Behaviors with Model-Written Evaluations
Figure 3 for Discovering Language Model Behaviors with Model-Written Evaluations
Figure 4 for Discovering Language Model Behaviors with Model-Written Evaluations
Viaarxiv icon