Picture for Chirag Nagpal

Chirag Nagpal

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models

Add code
Mar 18, 2024
Figure 1 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 2 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 3 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Figure 4 for A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models
Viaarxiv icon

The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa

Add code
Mar 11, 2024
Figure 1 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Figure 2 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Figure 3 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Figure 4 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Viaarxiv icon

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

Add code
Feb 20, 2024
Figure 1 for Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Viaarxiv icon

Transforming and Combining Rewards for Aligning Large Language Models

Add code
Feb 01, 2024
Viaarxiv icon

Theoretical guarantees on the best-of-n alignment policy

Add code
Jan 03, 2024
Viaarxiv icon

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Add code
Dec 21, 2023
Viaarxiv icon

Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes

Add code
Feb 24, 2023
Figure 1 for Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes
Figure 2 for Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes
Figure 3 for Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes
Figure 4 for Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes
Viaarxiv icon

Participatory Systems for Personalized Prediction

Add code
Feb 08, 2023
Figure 1 for Participatory Systems for Personalized Prediction
Figure 2 for Participatory Systems for Personalized Prediction
Figure 3 for Participatory Systems for Personalized Prediction
Figure 4 for Participatory Systems for Personalized Prediction
Viaarxiv icon

auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data

Add code
Apr 15, 2022
Figure 1 for auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data
Figure 2 for auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data
Figure 3 for auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data
Figure 4 for auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data
Viaarxiv icon