Alert button
Picture for William Merrill

William Merrill

Alert button

The Illusion of State in State-Space Models

Add code
Bookmark button
Alert button
Apr 12, 2024
William Merrill, Jackson Petty, Ashish Sabharwal

Viaarxiv icon

Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment

Add code
Bookmark button
Alert button
Feb 29, 2024
William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen

Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Viaarxiv icon

Transformers as Recognizers of Formal Languages: A Survey on Expressivity

Add code
Bookmark button
Alert button
Nov 01, 2023
Lena Strobl, William Merrill, Gail Weiss, David Chiang, Dana Angluin

Viaarxiv icon

The Expressive Power of Transformers with Chain of Thought

Add code
Bookmark button
Alert button
Oct 18, 2023
William Merrill, Ashish Sabharwal

Figure 1 for The Expressive Power of Transformers with Chain of Thought
Viaarxiv icon

The Expresssive Power of Transformers with Chain of Thought

Add code
Bookmark button
Alert button
Oct 16, 2023
William Merrill, Ashish Sabharwal

Figure 1 for The Expresssive Power of Transformers with Chain of Thought
Viaarxiv icon

How Language Model Hallucinations Can Snowball

Add code
Bookmark button
Alert button
May 22, 2023
Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith

Figure 1 for How Language Model Hallucinations Can Snowball
Figure 2 for How Language Model Hallucinations Can Snowball
Figure 3 for How Language Model Hallucinations Can Snowball
Figure 4 for How Language Model Hallucinations Can Snowball
Viaarxiv icon

A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks

Add code
Bookmark button
Alert button
Mar 21, 2023
William Merrill, Nikolaos Tsilivis, Aman Shukla

Figure 1 for A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
Figure 2 for A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
Figure 3 for A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
Figure 4 for A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
Viaarxiv icon