Alert button
Picture for Danny Halawi

Danny Halawi

Alert button

Approaching Human-Level Forecasting with Language Models

Add code
Bookmark button
Alert button
Feb 28, 2024
Danny Halawi, Fred Zhang, Chen Yueh-Han, Jacob Steinhardt

Viaarxiv icon

Overthinking the Truth: Understanding how Language Models Process False Demonstrations

Add code
Bookmark button
Alert button
Jul 18, 2023
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt

Viaarxiv icon

Eliciting Latent Predictions from Transformers with the Tuned Lens

Add code
Bookmark button
Alert button
Mar 15, 2023
Nora Belrose, Zach Furman, Logan Smith, Danny Halawi, Igor Ostrovsky, Lev McKinney, Stella Biderman, Jacob Steinhardt

Figure 1 for Eliciting Latent Predictions from Transformers with the Tuned Lens
Figure 2 for Eliciting Latent Predictions from Transformers with the Tuned Lens
Figure 3 for Eliciting Latent Predictions from Transformers with the Tuned Lens
Figure 4 for Eliciting Latent Predictions from Transformers with the Tuned Lens
Viaarxiv icon