Alert button
Picture for Noah Constant

Noah Constant

Alert button

Training LLMs over Neurally Compressed Text

Add code
Bookmark button
Alert button
Apr 04, 2024
Brian Lester, Jaehoon Lee, Alex Alemi, Jeffrey Pennington, Adam Roberts, Jascha Sohl-Dickstein, Noah Constant

Viaarxiv icon

Transfer Learning for Text Diffusion Models

Add code
Bookmark button
Alert button
Jan 30, 2024
Kehang Han, Kathleen Kenealy, Aditya Barua, Noah Fiedel, Noah Constant

Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Bookmark button
Alert button
Dec 22, 2023
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Bookmark button
Alert button
Nov 15, 2023
C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant, Peter J. Liu, Roman Novak, Yundi Qian, Noah Fiedel, Jascha Sohl-Dickstein

Viaarxiv icon

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation

Add code
Bookmark button
Alert button
Oct 05, 2023
Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong

Figure 1 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 2 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 3 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Figure 4 for FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Viaarxiv icon

UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining

Add code
Bookmark button
Alert button
Apr 18, 2023
Hyung Won Chung, Noah Constant, Xavier Garcia, Adam Roberts, Yi Tay, Sharan Narang, Orhan Firat

Figure 1 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 2 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 3 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 4 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Viaarxiv icon

Character-Aware Models Improve Visual Text Rendering

Add code
Bookmark button
Alert button
Dec 20, 2022
Rosanne Liu, Dan Garrette, Chitwan Saharia, William Chan, Adam Roberts, Sharan Narang, Irina Blok, RJ Mical, Mohammad Norouzi, Noah Constant

Figure 1 for Character-Aware Models Improve Visual Text Rendering
Figure 2 for Character-Aware Models Improve Visual Text Rendering
Figure 3 for Character-Aware Models Improve Visual Text Rendering
Figure 4 for Character-Aware Models Improve Visual Text Rendering
Viaarxiv icon

FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

Add code
Bookmark button
Alert button
Oct 01, 2022
Parker Riley, Timothy Dozat, Jan A. Botha, Xavier Garcia, Dan Garrette, Jason Riesa, Orhan Firat, Noah Constant

Figure 1 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 2 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 3 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 4 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Viaarxiv icon

Bidirectional Language Models Are Also Few-shot Learners

Add code
Bookmark button
Alert button
Sep 29, 2022
Ajay Patel, Bryan Li, Mohammad Sadegh Rasooli, Noah Constant, Colin Raffel, Chris Callison-Burch

Figure 1 for Bidirectional Language Models Are Also Few-shot Learners
Figure 2 for Bidirectional Language Models Are Also Few-shot Learners
Figure 3 for Bidirectional Language Models Are Also Few-shot Learners
Figure 4 for Bidirectional Language Models Are Also Few-shot Learners
Viaarxiv icon

Reducing Retraining by Recycling Parameter-Efficient Prompts

Add code
Bookmark button
Alert button
Aug 10, 2022
Brian Lester, Joshua Yurtsever, Siamak Shakeri, Noah Constant

Figure 1 for Reducing Retraining by Recycling Parameter-Efficient Prompts
Figure 2 for Reducing Retraining by Recycling Parameter-Efficient Prompts
Figure 3 for Reducing Retraining by Recycling Parameter-Efficient Prompts
Figure 4 for Reducing Retraining by Recycling Parameter-Efficient Prompts
Viaarxiv icon