Alert button
Picture for Josh Susskind

Josh Susskind

Alert button

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Jan 29, 2024
Yuhang Zang, Hanlin Goh, Josh Susskind, Chen Huang

Viaarxiv icon

What Algorithms can Transformers Learn? A Study in Length Generalization

Oct 24, 2023
Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Josh Susskind, Samy Bengio, Preetum Nakkiran

Viaarxiv icon

Matryoshka Diffusion Models

Oct 23, 2023
Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Josh Susskind, Navdeep Jaitly

Viaarxiv icon

Adaptivity and Modularity for Efficient Generalization Over Task Complexity

Oct 13, 2023
Samira Abnar, Omid Saremi, Laurent Dinh, Shantel Wilson, Miguel Angel Bautista, Chen Huang, Vimal Thilak, Etai Littwin, Jiatao Gu, Josh Susskind, Samy Bengio

Figure 1 for Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Figure 2 for Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Figure 3 for Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Figure 4 for Adaptivity and Modularity for Efficient Generalization Over Task Complexity
Viaarxiv icon

Generative Modeling with Phase Stochastic Bridges

Oct 13, 2023
Tianrong Chen, Jiatao Gu, Laurent Dinh, Evangelos A. Theodorou, Josh Susskind, Shuangfei Zhai

Figure 1 for Generative Modeling with Phase Stochastic Bridges
Figure 2 for Generative Modeling with Phase Stochastic Bridges
Figure 3 for Generative Modeling with Phase Stochastic Bridges
Figure 4 for Generative Modeling with Phase Stochastic Bridges
Viaarxiv icon

Boolformer: Symbolic Regression of Logic Functions with Transformers

Sep 21, 2023
Stéphane d'Ascoli, Samy Bengio, Josh Susskind, Emmanuel Abbé

Viaarxiv icon

Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation

Sep 20, 2023
Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theo Rekatsinas, Benjamin Han, Yunyao Li, Jeff Pound, Josh Susskind, Natalie Schluter, Ihab Ilyas, Navdeep Jaitly

Figure 1 for Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Figure 2 for Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Figure 3 for Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Figure 4 for Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
Viaarxiv icon

Value function estimation using conditional diffusion models for control

Jun 09, 2023
Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind

Figure 1 for Value function estimation using conditional diffusion models for control
Figure 2 for Value function estimation using conditional diffusion models for control
Figure 3 for Value function estimation using conditional diffusion models for control
Figure 4 for Value function estimation using conditional diffusion models for control
Viaarxiv icon

BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping

Jun 08, 2023
Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Lingjie Liu, Josh Susskind

Figure 1 for BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Figure 2 for BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Figure 3 for BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Figure 4 for BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Viaarxiv icon

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model

Jun 05, 2023
Yizhe Zhang, Jiatao Gu, Zhuofeng Wu, Shuangfei Zhai, Josh Susskind, Navdeep Jaitly

Figure 1 for PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Figure 2 for PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Figure 3 for PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Figure 4 for PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Viaarxiv icon