Picture for Matan Halevy

Matan Halevy

Hack-Verifiable Environments: Towards Evaluating Reward Hacking at Scale

Add code
May 20, 2026
Viaarxiv icon

Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

Add code
Sep 27, 2021
Figure 1 for Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Figure 2 for Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Figure 3 for Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Figure 4 for Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Viaarxiv icon