Alert button
Picture for Stella Biderman

Stella Biderman

Alert button

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Add code
Bookmark button
Alert button
Apr 10, 2024
Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao, Peng Zhou, Jian Zhu, Rui-Jie Zhu

Viaarxiv icon

Holographic Global Convolutional Networks for Long-Range Prediction Tasks in Malware Detection

Add code
Bookmark button
Alert button
Mar 23, 2024
Mohammad Mahmudul Alam, Edward Raff, Stella Biderman, Tim Oates, James Holt

Viaarxiv icon

On the Societal Impact of Open Foundation Models

Add code
Bookmark button
Alert button
Feb 27, 2024
Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

Figure 1 for On the Societal Impact of Open Foundation Models
Figure 2 for On the Societal Impact of Open Foundation Models
Viaarxiv icon

KMMLU: Measuring Massive Multitask Language Understanding in Korean

Add code
Bookmark button
Alert button
Feb 18, 2024
Guijin Son, Hanwool Lee, Sungdong Kim, Seungone Kim, Niklas Muennighoff, Taekyoon Choi, Cheonbok Park, Kang Min Yoo, Stella Biderman

Viaarxiv icon

Suppressing Pink Elephants with Direct Principle Feedback

Add code
Bookmark button
Alert button
Feb 13, 2024
Louis Castricato, Nathan Lile, Suraj Anand, Hailey Schoelkopf, Siddharth Verma, Stella Biderman

Viaarxiv icon

The Case for Co-Designing Model Architectures with Hardware

Add code
Bookmark button
Alert button
Jan 30, 2024
Quentin Anthony, Jacob Hatef, Deepak Narayanan, Stella Biderman, Stas Bekman, Junqi Yin, Aamir Shafi, Hari Subramoni, Dhabaleswar Panda

Viaarxiv icon

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

Add code
Bookmark button
Alert button
Jan 23, 2024
Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer

Viaarxiv icon

Grokking Group Multiplication with Cosets

Add code
Bookmark button
Alert button
Dec 11, 2023
Dashiell Stander, Qinan Yu, Honglu Fan, Stella Biderman

Viaarxiv icon

Llemma: An Open Language Model For Mathematics

Add code
Bookmark button
Alert button
Oct 16, 2023
Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck

Figure 1 for Llemma: An Open Language Model For Mathematics
Figure 2 for Llemma: An Open Language Model For Mathematics
Figure 3 for Llemma: An Open Language Model For Mathematics
Figure 4 for Llemma: An Open Language Model For Mathematics
Viaarxiv icon

Stay on topic with Classifier-Free Guidance

Add code
Bookmark button
Alert button
Jun 30, 2023
Guillaume Sanchez, Honglu Fan, Alexander Spangher, Elad Levi, Pawan Sasanka Ammanamanchi, Stella Biderman

Figure 1 for Stay on topic with Classifier-Free Guidance
Figure 2 for Stay on topic with Classifier-Free Guidance
Figure 3 for Stay on topic with Classifier-Free Guidance
Figure 4 for Stay on topic with Classifier-Free Guidance
Viaarxiv icon