Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Forecasting Future World Events with Neural Networks


Jun 30, 2022
Andy Zou , Tristan Xiao , Ryan Jia , Joe Kwon , Mantas Mazeika , Richard Li , Dawn Song , Jacob Steinhardt , Owain Evans , Dan Hendrycks

* Code and the Autocast dataset are available at https://github.com/andyzoujm/autocast 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior


Jun 27, 2022
Jean-Stanislas Denain , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Supply-Side Equilibria in Recommender Systems


Jun 27, 2022
Meena Jagadeesan , Nikhil Garg , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize


Mar 11, 2022
Alexander Wei , Wei Hu , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Capturing Failures of Large Language Models via Human Cognitive Biases


Feb 24, 2022
Erik Jones , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Predicting Out-of-Distribution Error with the Projection Norm


Feb 11, 2022
Yaodong Yu , Zitong Yang , Alexander Wei , Yi Ma , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Summarizing Differences between Text Distributions with Natural Language


Jan 28, 2022
Ruiqi Zhong , Charlie Snell , Dan Klein , Jacob Steinhardt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models


Jan 10, 2022
Alexander Pan , Kush Bhatia , Jacob Steinhardt

* 19 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures


Dec 11, 2021
Dan Hendrycks , Andy Zou , Mantas Mazeika , Leonard Tang , Bo Li , Dawn Song , Jacob Steinhardt

* Code and models are available at https://github.com/andyzoujm/pixmix 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Effect of Model Size on Worst-Group Generalization


Dec 08, 2021
Alan Pham , Eunice Chan , Vikranth Srivatsa , Dhruba Ghosh , Yaoqing Yang , Yaodong Yu , Ruiqi Zhong , Joseph E. Gonzalez , Jacob Steinhardt

* The first four authors contributed equally to the work 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
5
6
>>