Picture for William F. Shen

William F. Shen

Scaling Small Agents Through Strategy Auctions

Add code
Feb 02, 2026
Viaarxiv icon

Training AI Co-Scientists Using Rubric Rewards

Add code
Dec 29, 2025
Viaarxiv icon

Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic

Add code
Dec 08, 2025
Figure 1 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 2 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 3 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 4 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Viaarxiv icon

Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning

Add code
Jun 17, 2025
Viaarxiv icon

DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models

Add code
May 28, 2025
Viaarxiv icon

Permissioned LLMs: Enforcing Access Control in Large Language Models

Add code
May 28, 2025
Figure 1 for Permissioned LLMs: Enforcing Access Control in Large Language Models
Figure 2 for Permissioned LLMs: Enforcing Access Control in Large Language Models
Figure 3 for Permissioned LLMs: Enforcing Access Control in Large Language Models
Figure 4 for Permissioned LLMs: Enforcing Access Control in Large Language Models
Viaarxiv icon

Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

Add code
May 26, 2025
Viaarxiv icon

LUNAR: LLM Unlearning via Neural Activation Redirection

Add code
Feb 11, 2025
Viaarxiv icon

DEPT: Decoupled Embeddings for Pre-training Language Models

Add code
Oct 07, 2024
Figure 1 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 2 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 3 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 4 for DEPT: Decoupled Embeddings for Pre-training Language Models
Viaarxiv icon

PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs

Add code
Jun 24, 2024
Figure 1 for PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs
Figure 2 for PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs
Figure 3 for PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs
Figure 4 for PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs
Viaarxiv icon