Picture for Zora Che

Zora Che

EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles

Add code
May 28, 2025
Viaarxiv icon

Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics

Add code
May 25, 2025
Viaarxiv icon

AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security

Add code
Apr 29, 2025
Viaarxiv icon

PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models

Add code
Mar 10, 2025
Viaarxiv icon

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Add code
Oct 06, 2024
Viaarxiv icon

Auction-Based Regulation for Artificial Intelligence

Add code
Oct 02, 2024
Figure 1 for Auction-Based Regulation for Artificial Intelligence
Figure 2 for Auction-Based Regulation for Artificial Intelligence
Figure 3 for Auction-Based Regulation for Artificial Intelligence
Figure 4 for Auction-Based Regulation for Artificial Intelligence
Viaarxiv icon

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?

Add code
Jul 24, 2024
Viaarxiv icon

SAIL: Self-Improving Efficient Online Alignment of Large Language Models

Add code
Jun 21, 2024
Viaarxiv icon

Transferring Fairness under Distribution Shifts via Fair Consistency Regularization

Add code
Jun 26, 2022
Figure 1 for Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
Figure 2 for Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
Figure 3 for Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
Figure 4 for Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
Viaarxiv icon