Picture for Daniel Gissin

Daniel Gissin

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Add code
Aug 22, 2024
Figure 1 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 2 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 3 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 4 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Viaarxiv icon

The Implicit Bias of Depth: How Incremental Learning Drives Generalization

Add code
Sep 26, 2019
Figure 1 for The Implicit Bias of Depth: How Incremental Learning Drives Generalization
Figure 2 for The Implicit Bias of Depth: How Incremental Learning Drives Generalization
Figure 3 for The Implicit Bias of Depth: How Incremental Learning Drives Generalization
Figure 4 for The Implicit Bias of Depth: How Incremental Learning Drives Generalization
Viaarxiv icon

Discriminative Active Learning

Add code
Jul 15, 2019
Figure 1 for Discriminative Active Learning
Figure 2 for Discriminative Active Learning
Figure 3 for Discriminative Active Learning
Viaarxiv icon