Alert button
Picture for Alex Alemi

Alex Alemi

Alert button

Training LLMs over Neurally Compressed Text

Add code
Bookmark button
Alert button
Apr 04, 2024
Brian Lester, Jaehoon Lee, Alex Alemi, Jeffrey Pennington, Adam Roberts, Jascha Sohl-Dickstein, Noah Constant

Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Bookmark button
Alert button
Dec 22, 2023
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Bookmark button
Alert button
Nov 15, 2023
C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant, Peter J. Liu, Roman Novak, Yundi Qian, Noah Fiedel, Jascha Sohl-Dickstein

Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Bookmark button
Alert button
Sep 25, 2023
Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith

Figure 1 for Small-scale proxies for large-scale Transformer training instabilities
Figure 2 for Small-scale proxies for large-scale Transformer training instabilities
Figure 3 for Small-scale proxies for large-scale Transformer training instabilities
Figure 4 for Small-scale proxies for large-scale Transformer training instabilities
Viaarxiv icon

Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

Add code
Bookmark button
Alert button
May 17, 2019
Bryan Seybold, Emily Fertig, Alex Alemi, Ian Fischer

Figure 1 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 2 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 3 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 4 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Viaarxiv icon

Watch Your Step: Learning Node Embeddings via Graph Attention

Add code
Bookmark button
Alert button
Sep 12, 2018
Sami Abu-El-Haija, Bryan Perozzi, Rami Al-Rfou, Alex Alemi

Figure 1 for Watch Your Step: Learning Node Embeddings via Graph Attention
Figure 2 for Watch Your Step: Learning Node Embeddings via Graph Attention
Figure 3 for Watch Your Step: Learning Node Embeddings via Graph Attention
Figure 4 for Watch Your Step: Learning Node Embeddings via Graph Attention
Viaarxiv icon

TensorFlow Distributions

Add code
Bookmark button
Alert button
Nov 28, 2017
Joshua V. Dillon, Ian Langmore, Dustin Tran, Eugene Brevdo, Srinivas Vasudevan, Dave Moore, Brian Patton, Alex Alemi, Matt Hoffman, Rif A. Saurous

Figure 1 for TensorFlow Distributions
Figure 2 for TensorFlow Distributions
Figure 3 for TensorFlow Distributions
Viaarxiv icon

Motion Prediction Under Multimodality with Conditional Stochastic Networks

Add code
Bookmark button
Alert button
May 05, 2017
Katerina Fragkiadaki, Jonathan Huang, Alex Alemi, Sudheendra Vijayanarasimhan, Susanna Ricco, Rahul Sukthankar

Figure 1 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 2 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 3 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Figure 4 for Motion Prediction Under Multimodality with Conditional Stochastic Networks
Viaarxiv icon

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Add code
Bookmark button
Alert button
Aug 23, 2016
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, Alex Alemi

Figure 1 for Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Figure 2 for Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Figure 3 for Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Figure 4 for Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Viaarxiv icon