Picture for David Krueger

David Krueger

Influence Functions for Scalable Data Attribution in Diffusion Models

Add code
Oct 17, 2024
Viaarxiv icon

Analyzing (In)Abilities of SAEs via Formal Languages

Add code
Oct 15, 2024
Viaarxiv icon

PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning

Add code
Oct 11, 2024
Figure 1 for PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Figure 2 for PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Figure 3 for PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Figure 4 for PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Viaarxiv icon

Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models

Add code
Oct 09, 2024
Figure 1 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 2 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 3 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Figure 4 for Sparse Autoencoders Reveal Universal Feature Spaces Across Large Language Models
Viaarxiv icon

Towards Interpreting Visual Information Processing in Vision-Language Models

Add code
Oct 09, 2024
Figure 1 for Towards Interpreting Visual Information Processing in Vision-Language Models
Figure 2 for Towards Interpreting Visual Information Processing in Vision-Language Models
Figure 3 for Towards Interpreting Visual Information Processing in Vision-Language Models
Figure 4 for Towards Interpreting Visual Information Processing in Vision-Language Models
Viaarxiv icon

Exploring the design space of deep-learning-based weather forecasting systems

Add code
Oct 09, 2024
Figure 1 for Exploring the design space of deep-learning-based weather forecasting systems
Figure 2 for Exploring the design space of deep-learning-based weather forecasting systems
Figure 3 for Exploring the design space of deep-learning-based weather forecasting systems
Figure 4 for Exploring the design space of deep-learning-based weather forecasting systems
Viaarxiv icon

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Figure 1 for Permissive Information-Flow Analysis for Large Language Models
Figure 2 for Permissive Information-Flow Analysis for Large Language Models
Figure 3 for Permissive Information-Flow Analysis for Large Language Models
Figure 4 for Permissive Information-Flow Analysis for Large Language Models
Viaarxiv icon

Input Space Mode Connectivity in Deep Neural Networks

Add code
Sep 09, 2024
Viaarxiv icon

Protecting against simultaneous data poisoning attacks

Add code
Aug 23, 2024
Viaarxiv icon

A deeper look at depth pruning of LLMs

Add code
Jul 23, 2024
Figure 1 for A deeper look at depth pruning of LLMs
Figure 2 for A deeper look at depth pruning of LLMs
Figure 3 for A deeper look at depth pruning of LLMs
Figure 4 for A deeper look at depth pruning of LLMs
Viaarxiv icon