Picture for Zhiwei Steven Wu

Zhiwei Steven Wu

Position: LLM Unlearning Benchmarks are Weak Measures of Progress

Add code
Oct 03, 2024
Figure 1 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 2 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 3 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 4 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Viaarxiv icon

Multi-group Uncertainty Quantification for Long-form Text Generation

Add code
Jul 25, 2024
Viaarxiv icon

Jogging the Memory of Unlearned Model Through Targeted Relearning Attack

Add code
Jun 19, 2024
Viaarxiv icon

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard

Add code
Jun 06, 2024
Figure 1 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 2 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 3 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 4 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Viaarxiv icon

Orthogonal Causal Calibration

Add code
Jun 04, 2024
Viaarxiv icon

Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Add code
Jun 02, 2024
Figure 1 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 2 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 3 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 4 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Viaarxiv icon

Reconciling Model Multiplicity for Downstream Decision Making

Add code
May 30, 2024
Viaarxiv icon

Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

Add code
May 30, 2024
Viaarxiv icon

Predictive Performance Comparison of Decision Policies Under Confounding

Add code
Apr 01, 2024
Figure 1 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 2 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 3 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 4 for Predictive Performance Comparison of Decision Policies Under Confounding
Viaarxiv icon

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

Add code
Mar 08, 2024
Viaarxiv icon