Picture for Chirag Nagpal

Chirag Nagpal

Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness

Add code
Jun 04, 2025
Viaarxiv icon

Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models

Add code
Jan 08, 2025
Viaarxiv icon

InfAlign: Inference-aware language model alignment

Add code
Dec 27, 2024
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

Robust Preference Optimization through Reward Model Distillation

Add code
May 29, 2024
Viaarxiv icon

A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models

Add code
Mar 18, 2024
Viaarxiv icon

The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa

Add code
Mar 11, 2024
Figure 1 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Figure 2 for The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa
Viaarxiv icon

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

Add code
Feb 20, 2024
Figure 1 for Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Viaarxiv icon

Transforming and Combining Rewards for Aligning Large Language Models

Add code
Feb 01, 2024
Viaarxiv icon

Theoretical guarantees on the best-of-n alignment policy

Add code
Jan 03, 2024
Figure 1 for Theoretical guarantees on the best-of-n alignment policy
Figure 2 for Theoretical guarantees on the best-of-n alignment policy
Figure 3 for Theoretical guarantees on the best-of-n alignment policy
Viaarxiv icon