Alert button
Picture for Jimmy Ba

Jimmy Ba

Alert button

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Bookmark button
Alert button
Mar 06, 2024
Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer, Andy Zou, Mantas Mazeika, Zifan Wang, Palash Oswal, Weiran Liu, Adam A. Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Russell Kaplan, Ian Steneker, David Campbell, Brad Jokubaitis, Alex Levinson, Jean Wang, William Qian, Kallol Krishna Karmakar, Steven Basart, Stephen Fitz, Mindy Levine, Ponnurangam Kumaraguru, Uday Tupakula, Vijay Varadharajan, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks

Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

Using Large Language Models for Hyperparameter Optimization

Add code
Bookmark button
Alert button
Dec 07, 2023
Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba

Viaarxiv icon

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

Add code
Bookmark button
Alert button
Oct 10, 2023
Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba

Figure 1 for OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Figure 2 for OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Figure 3 for OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Figure 4 for OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Viaarxiv icon

Identifying the Risks of LM Agents with an LM-Emulated Sandbox

Add code
Bookmark button
Alert button
Sep 25, 2023
Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto

Figure 1 for Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Figure 2 for Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Figure 3 for Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Figure 4 for Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Viaarxiv icon

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Add code
Bookmark button
Alert button
Jun 05, 2023
Shalev Lifshitz, Keiran Paster, Harris Chan, Jimmy Ba, Sheila McIlraith

Figure 1 for STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Figure 2 for STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Figure 3 for STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Figure 4 for STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Viaarxiv icon

Training on Thin Air: Improve Image Classification with Generated Data

Add code
Bookmark button
Alert button
May 24, 2023
Yongchao Zhou, Hshmat Sahak, Jimmy Ba

Figure 1 for Training on Thin Air: Improve Image Classification with Generated Data
Figure 2 for Training on Thin Air: Improve Image Classification with Generated Data
Figure 3 for Training on Thin Air: Improve Image Classification with Generated Data
Figure 4 for Training on Thin Air: Improve Image Classification with Generated Data
Viaarxiv icon

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

Add code
Bookmark button
Alert button
May 22, 2023
Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

Figure 1 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 2 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 3 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 4 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Viaarxiv icon

Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding

Add code
Bookmark button
Alert button
May 19, 2023
Augustin Toma, Patrick R. Lawler, Jimmy Ba, Rahul G. Krishnan, Barry B. Rubin, Bo Wang

Figure 1 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 2 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 3 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Figure 4 for Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Viaarxiv icon

Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization

Add code
Bookmark button
Alert button
May 06, 2023
Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Mike Lewis, Jimmy Ba, Amjad Almahairi

Figure 1 for Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization
Figure 2 for Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization
Figure 3 for Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization
Figure 4 for Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization
Viaarxiv icon

TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation

Add code
Bookmark button
Alert button
Apr 26, 2023
Zhaoyan Liu, Noel Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem

Figure 1 for TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Figure 2 for TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Figure 3 for TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Figure 4 for TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation
Viaarxiv icon