Picture for Dennis Wei

Dennis Wei

Selective Explanations

Add code
May 29, 2024
Viaarxiv icon

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

Add code
Apr 03, 2024
Figure 1 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 2 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 3 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 4 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Viaarxiv icon

Multi-Level Explanations for Generative Language Models

Add code
Mar 21, 2024
Figure 1 for Multi-Level Explanations for Generative Language Models
Figure 2 for Multi-Level Explanations for Generative Language Models
Figure 3 for Multi-Level Explanations for Generative Language Models
Figure 4 for Multi-Level Explanations for Generative Language Models
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Figure 1 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 2 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 3 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 4 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Viaarxiv icon

Causal Bandits with General Causal Models and Interventions

Add code
Mar 01, 2024
Figure 1 for Causal Bandits with General Causal Models and Interventions
Figure 2 for Causal Bandits with General Causal Models and Interventions
Figure 3 for Causal Bandits with General Causal Models and Interventions
Figure 4 for Causal Bandits with General Causal Models and Interventions
Viaarxiv icon

Trust Regions for Explanations via Black-Box Probabilistic Certification

Add code
Feb 21, 2024
Figure 1 for Trust Regions for Explanations via Black-Box Probabilistic Certification
Figure 2 for Trust Regions for Explanations via Black-Box Probabilistic Certification
Figure 3 for Trust Regions for Explanations via Black-Box Probabilistic Certification
Figure 4 for Trust Regions for Explanations via Black-Box Probabilistic Certification
Viaarxiv icon

Effective Human-AI Teams via Learned Natural Language Rules and Onboarding

Add code
Nov 07, 2023
Viaarxiv icon

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

Add code
Oct 19, 2023
Figure 1 for SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Figure 2 for SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Figure 3 for SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Figure 4 for SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Viaarxiv icon

Interpretable Differencing of Machine Learning Models

Add code
Jun 13, 2023
Viaarxiv icon

Convex Bounds on the Softmax Function with Applications to Robustness Verification

Add code
Mar 03, 2023
Viaarxiv icon