Alert button
Picture for J. Zico Kolter

J. Zico Kolter

Alert button

Representation Engineering: A Top-Down Approach to AI Transparency

Oct 10, 2023
Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

Figure 1 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 2 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 3 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 4 for Representation Engineering: A Top-Down Approach to AI Transparency
Viaarxiv icon

Understanding prompt engineering may not require rethinking generalization

Oct 06, 2023
Victor Akinwande, Yiding Jiang, Dylan Sam, J. Zico Kolter

Viaarxiv icon

Universal and Transferable Adversarial Attacks on Aligned Language Models

Jul 27, 2023
Andy Zou, Zifan Wang, J. Zico Kolter, Matt Fredrikson

Figure 1 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 2 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 3 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Figure 4 for Universal and Transferable Adversarial Attacks on Aligned Language Models
Viaarxiv icon

Can Neural Network Memorization Be Localized?

Jul 18, 2023
Pratyush Maini, Michael C. Mozer, Hanie Sedghi, Zachary C. Lipton, J. Zico Kolter, Chiyuan Zhang

Figure 1 for Can Neural Network Memorization Be Localized?
Figure 2 for Can Neural Network Memorization Be Localized?
Figure 3 for Can Neural Network Memorization Be Localized?
Figure 4 for Can Neural Network Memorization Be Localized?
Viaarxiv icon

Monotone deep Boltzmann machines

Jul 11, 2023
Zhili Feng, Ezra Winston, J. Zico Kolter

Figure 1 for Monotone deep Boltzmann machines
Figure 2 for Monotone deep Boltzmann machines
Figure 3 for Monotone deep Boltzmann machines
Figure 4 for Monotone deep Boltzmann machines
Viaarxiv icon

Leveraging Multiple Descriptive Features for Robust Few-shot Image Learning

Jul 10, 2023
Zhili Feng, Anna Bair, J. Zico Kolter

Figure 1 for Leveraging Multiple Descriptive Features for Robust Few-shot Image Learning
Figure 2 for Leveraging Multiple Descriptive Features for Robust Few-shot Image Learning
Figure 3 for Leveraging Multiple Descriptive Features for Robust Few-shot Image Learning
Figure 4 for Leveraging Multiple Descriptive Features for Robust Few-shot Image Learning
Viaarxiv icon

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning

Jul 06, 2023
Pratyush Maini, Sachin Goyal, Zachary C. Lipton, J. Zico Kolter, Aditi Raghunathan

Figure 1 for T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
Figure 2 for T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
Figure 3 for T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
Figure 4 for T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
Viaarxiv icon

Localized Text-to-Image Generation for Free via Cross Attention Control

Jun 26, 2023
Yutong He, Ruslan Salakhutdinov, J. Zico Kolter

Figure 1 for Localized Text-to-Image Generation for Free via Cross Attention Control
Figure 2 for Localized Text-to-Image Generation for Free via Cross Attention Control
Figure 3 for Localized Text-to-Image Generation for Free via Cross Attention Control
Figure 4 for Localized Text-to-Image Generation for Free via Cross Attention Control
Viaarxiv icon

A Simple and Effective Pruning Approach for Large Language Models

Jun 20, 2023
Mingjie Sun, Zhuang Liu, Anna Bair, J. Zico Kolter

Figure 1 for A Simple and Effective Pruning Approach for Large Language Models
Figure 2 for A Simple and Effective Pruning Approach for Large Language Models
Figure 3 for A Simple and Effective Pruning Approach for Large Language Models
Figure 4 for A Simple and Effective Pruning Approach for Large Language Models
Viaarxiv icon

On the Importance of Exploration for Generalization in Reinforcement Learning

Jun 08, 2023
Yiding Jiang, J. Zico Kolter, Roberta Raileanu

Figure 1 for On the Importance of Exploration for Generalization in Reinforcement Learning
Figure 2 for On the Importance of Exploration for Generalization in Reinforcement Learning
Figure 3 for On the Importance of Exploration for Generalization in Reinforcement Learning
Figure 4 for On the Importance of Exploration for Generalization in Reinforcement Learning
Viaarxiv icon