Alert button
Picture for Michael Poli

Michael Poli

Alert button

Mechanistic Design and Scaling of Hybrid Architectures

Add code
Bookmark button
Alert button
Mar 26, 2024
Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli

Viaarxiv icon

Zoology: Measuring and Improving Recall in Efficient Language Models

Add code
Bookmark button
Alert button
Dec 08, 2023
Simran Arora, Sabri Eyuboglu, Aman Timalsina, Isys Johnson, Michael Poli, James Zou, Atri Rudra, Christopher Ré

Viaarxiv icon

Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

Add code
Bookmark button
Alert button
Oct 28, 2023
Stefano Massaroli, Michael Poli, Daniel Y. Fu, Hermann Kumbong, Rom N. Parnichkun, Aman Timalsina, David W. Romero, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher Re, Stefano Ermon, Yoshua Bengio

Figure 1 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 2 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 3 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 4 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Viaarxiv icon

Learning Efficient Surrogate Dynamic Models with Graph Spline Networks

Add code
Bookmark button
Alert button
Oct 25, 2023
Chuanbo Hua, Federico Berto, Michael Poli, Stefano Massaroli, Jinkyoo Park

Viaarxiv icon

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Add code
Bookmark button
Alert button
Oct 18, 2023
Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré

Viaarxiv icon

HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution

Add code
Bookmark button
Alert button
Jun 27, 2023
Eric Nguyen, Michael Poli, Marjan Faizi, Armin Thomas, Callum Birch-Sykes, Michael Wornow, Aman Patel, Clayton Rabideau, Stefano Massaroli, Yoshua Bengio, Stefano Ermon, Stephen A. Baccus, Chris Ré

Figure 1 for HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution
Figure 2 for HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution
Figure 3 for HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution
Figure 4 for HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution
Viaarxiv icon

Ideal Abstractions for Decision-Focused Learning

Add code
Bookmark button
Alert button
Mar 29, 2023
Michael Poli, Stefano Massaroli, Stefano Ermon, Bryan Wilder, Eric Horvitz

Figure 1 for Ideal Abstractions for Decision-Focused Learning
Figure 2 for Ideal Abstractions for Decision-Focused Learning
Figure 3 for Ideal Abstractions for Decision-Focused Learning
Figure 4 for Ideal Abstractions for Decision-Focused Learning
Viaarxiv icon

Effectively Modeling Time Series with Simple Discrete State Spaces

Add code
Bookmark button
Alert button
Mar 16, 2023
Michael Zhang, Khaled K. Saab, Michael Poli, Tri Dao, Karan Goel, Christopher Ré

Figure 1 for Effectively Modeling Time Series with Simple Discrete State Spaces
Figure 2 for Effectively Modeling Time Series with Simple Discrete State Spaces
Figure 3 for Effectively Modeling Time Series with Simple Discrete State Spaces
Figure 4 for Effectively Modeling Time Series with Simple Discrete State Spaces
Viaarxiv icon

Hyena Hierarchy: Towards Larger Convolutional Language Models

Add code
Bookmark button
Alert button
Mar 06, 2023
Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré

Figure 1 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 2 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 3 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 4 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Viaarxiv icon