Alert button
Picture for Jacob Steinhardt

Jacob Steinhardt

Alert button

How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios

Add code
Bookmark button
Alert button
Oct 18, 2022
Mantas Mazeika, Eric Tang, Andy Zou, Steven Basart, Jun Shern Chan, Dawn Song, David Forsyth, Jacob Steinhardt, Dan Hendrycks

Figure 1 for How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
Figure 2 for How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
Figure 3 for How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
Figure 4 for How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
Viaarxiv icon

Forecasting Future World Events with Neural Networks

Add code
Bookmark button
Alert button
Jun 30, 2022
Andy Zou, Tristan Xiao, Ryan Jia, Joe Kwon, Mantas Mazeika, Richard Li, Dawn Song, Jacob Steinhardt, Owain Evans, Dan Hendrycks

Figure 1 for Forecasting Future World Events with Neural Networks
Figure 2 for Forecasting Future World Events with Neural Networks
Figure 3 for Forecasting Future World Events with Neural Networks
Figure 4 for Forecasting Future World Events with Neural Networks
Viaarxiv icon

Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior

Add code
Bookmark button
Alert button
Jun 27, 2022
Jean-Stanislas Denain, Jacob Steinhardt

Figure 1 for Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Figure 2 for Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Figure 3 for Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Figure 4 for Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior
Viaarxiv icon

Supply-Side Equilibria in Recommender Systems

Add code
Bookmark button
Alert button
Jun 27, 2022
Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt

Figure 1 for Supply-Side Equilibria in Recommender Systems
Figure 2 for Supply-Side Equilibria in Recommender Systems
Figure 3 for Supply-Side Equilibria in Recommender Systems
Viaarxiv icon

More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize

Add code
Bookmark button
Alert button
Mar 11, 2022
Alexander Wei, Wei Hu, Jacob Steinhardt

Figure 1 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 2 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 3 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 4 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Viaarxiv icon

Capturing Failures of Large Language Models via Human Cognitive Biases

Add code
Bookmark button
Alert button
Feb 24, 2022
Erik Jones, Jacob Steinhardt

Figure 1 for Capturing Failures of Large Language Models via Human Cognitive Biases
Figure 2 for Capturing Failures of Large Language Models via Human Cognitive Biases
Figure 3 for Capturing Failures of Large Language Models via Human Cognitive Biases
Figure 4 for Capturing Failures of Large Language Models via Human Cognitive Biases
Viaarxiv icon

Predicting Out-of-Distribution Error with the Projection Norm

Add code
Bookmark button
Alert button
Feb 11, 2022
Yaodong Yu, Zitong Yang, Alexander Wei, Yi Ma, Jacob Steinhardt

Figure 1 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 2 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 3 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 4 for Predicting Out-of-Distribution Error with the Projection Norm
Viaarxiv icon

Summarizing Differences between Text Distributions with Natural Language

Add code
Bookmark button
Alert button
Jan 28, 2022
Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt

Figure 1 for Summarizing Differences between Text Distributions with Natural Language
Figure 2 for Summarizing Differences between Text Distributions with Natural Language
Figure 3 for Summarizing Differences between Text Distributions with Natural Language
Figure 4 for Summarizing Differences between Text Distributions with Natural Language
Viaarxiv icon

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models

Add code
Bookmark button
Alert button
Jan 10, 2022
Alexander Pan, Kush Bhatia, Jacob Steinhardt

Figure 1 for The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Figure 2 for The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Figure 3 for The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Figure 4 for The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
Viaarxiv icon

PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures

Add code
Bookmark button
Alert button
Dec 11, 2021
Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt

Figure 1 for PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
Figure 2 for PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
Figure 3 for PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
Figure 4 for PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
Viaarxiv icon