Picture for Shaojie Bai

Shaojie Bai

Stabilizing Equilibrium Models by Jacobian Regularization

Add code
Jun 28, 2021
Figure 1 for Stabilizing Equilibrium Models by Jacobian Regularization
Figure 2 for Stabilizing Equilibrium Models by Jacobian Regularization
Figure 3 for Stabilizing Equilibrium Models by Jacobian Regularization
Figure 4 for Stabilizing Equilibrium Models by Jacobian Regularization
Viaarxiv icon

SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models

Add code
Jun 24, 2021
Figure 1 for SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models
Figure 2 for SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models
Figure 3 for SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models
Figure 4 for SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models
Viaarxiv icon

A Note on Connecting Barlow Twins with Negative-Sample-Free Contrastive Learning

Add code
Apr 28, 2021
Figure 1 for A Note on Connecting Barlow Twins with Negative-Sample-Free Contrastive Learning
Figure 2 for A Note on Connecting Barlow Twins with Negative-Sample-Free Contrastive Learning
Viaarxiv icon

A community-powered search of machine learning strategy space to find NMR property prediction models

Add code
Aug 13, 2020
Figure 1 for A community-powered search of machine learning strategy space to find NMR property prediction models
Figure 2 for A community-powered search of machine learning strategy space to find NMR property prediction models
Figure 3 for A community-powered search of machine learning strategy space to find NMR property prediction models
Figure 4 for A community-powered search of machine learning strategy space to find NMR property prediction models
Viaarxiv icon

Multiscale Deep Equilibrium Models

Add code
Jun 15, 2020
Figure 1 for Multiscale Deep Equilibrium Models
Figure 2 for Multiscale Deep Equilibrium Models
Figure 3 for Multiscale Deep Equilibrium Models
Figure 4 for Multiscale Deep Equilibrium Models
Viaarxiv icon

Deep Equilibrium Models

Add code
Sep 03, 2019
Figure 1 for Deep Equilibrium Models
Figure 2 for Deep Equilibrium Models
Figure 3 for Deep Equilibrium Models
Figure 4 for Deep Equilibrium Models
Viaarxiv icon

Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel

Add code
Aug 30, 2019
Figure 1 for Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Figure 2 for Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Figure 3 for Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Figure 4 for Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Viaarxiv icon

Multimodal Transformer for Unaligned Multimodal Language Sequences

Add code
Jun 01, 2019
Figure 1 for Multimodal Transformer for Unaligned Multimodal Language Sequences
Figure 2 for Multimodal Transformer for Unaligned Multimodal Language Sequences
Figure 3 for Multimodal Transformer for Unaligned Multimodal Language Sequences
Figure 4 for Multimodal Transformer for Unaligned Multimodal Language Sequences
Viaarxiv icon

Trellis Networks for Sequence Modeling

Add code
Oct 15, 2018
Figure 1 for Trellis Networks for Sequence Modeling
Figure 2 for Trellis Networks for Sequence Modeling
Figure 3 for Trellis Networks for Sequence Modeling
Figure 4 for Trellis Networks for Sequence Modeling
Viaarxiv icon

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Add code
Apr 19, 2018
Figure 1 for An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Figure 2 for An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Figure 3 for An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Figure 4 for An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Viaarxiv icon