Alert button
Picture for Sanmi Koyejo

Sanmi Koyejo

Alert button

Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells

Nov 04, 2023
Rylan Schaeffer, Mikail Khona, Tzuhsuan Ma, Cristóbal Eyzaguirre, Sanmi Koyejo, Ila Rani Fiete

Viaarxiv icon

Learning to (Learn at Test Time)

Oct 20, 2023
Yu Sun, Xinhao Li, Karan Dalal, Chloe Hsu, Sanmi Koyejo, Carlos Guestrin, Xiaolong Wang, Tatsunori Hashimoto, Xinlei Chen

Viaarxiv icon

Representation Engineering: A Top-Down Approach to AI Transparency

Oct 10, 2023
Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

Figure 1 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 2 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 3 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 4 for Representation Engineering: A Top-Down Approach to AI Transparency
Viaarxiv icon

Deceptive Alignment Monitoring

Jul 26, 2023
Andres Carranza, Dhruv Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

Viaarxiv icon

Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting

Jul 23, 2023
Rylan Schaeffer, Kateryna Pistunova, Samar Khanna, Sarthak Consul, Sanmi Koyejo

Figure 1 for Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Figure 2 for Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Figure 3 for Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Figure 4 for Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting
Viaarxiv icon

FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation

Jul 20, 2023
Dhruv Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

Figure 1 for FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation
Viaarxiv icon

Communication-Efficient Federated Learning through Importance Sampling

Jun 25, 2023
Berivan Isik, Francesco Pase, Deniz Gunduz, Sanmi Koyejo, Tsachy Weissman, Michele Zorzi

Figure 1 for Communication-Efficient Federated Learning through Importance Sampling
Figure 2 for Communication-Efficient Federated Learning through Importance Sampling
Figure 3 for Communication-Efficient Federated Learning through Importance Sampling
Figure 4 for Communication-Efficient Federated Learning through Importance Sampling
Viaarxiv icon

Is Pre-training Truly Better Than Meta-Learning?

Jun 24, 2023
Brando Miranda, Patrick Yu, Saumya Goyal, Yu-Xiong Wang, Sanmi Koyejo

Figure 1 for Is Pre-training Truly Better Than Meta-Learning?
Figure 2 for Is Pre-training Truly Better Than Meta-Learning?
Figure 3 for Is Pre-training Truly Better Than Meta-Learning?
Figure 4 for Is Pre-training Truly Better Than Meta-Learning?
Viaarxiv icon

Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data

Jun 24, 2023
Alycia Lee, Brando Miranda, Sanmi Koyejo

Figure 1 for Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data
Figure 2 for Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data
Figure 3 for Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data
Figure 4 for Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data
Viaarxiv icon