Picture for Yonatan Belinkov

Yonatan Belinkov

Fast Forwarding Low-Rank Training

Add code
Sep 06, 2024
Figure 1 for Fast Forwarding Low-Rank Training
Figure 2 for Fast Forwarding Low-Rank Training
Figure 3 for Fast Forwarding Low-Rank Training
Figure 4 for Fast Forwarding Low-Rank Training
Viaarxiv icon

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Add code
Aug 22, 2024
Figure 1 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 2 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 3 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 4 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Viaarxiv icon

The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability

Add code
Aug 02, 2024
Figure 1 for The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Figure 2 for The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Figure 3 for The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Viaarxiv icon

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions

Add code
Jul 21, 2024
Viaarxiv icon

Confidence Regulation Neurons in Language Models

Add code
Jun 24, 2024
Viaarxiv icon

REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space

Add code
Jun 13, 2024
Figure 1 for REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Figure 2 for REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Figure 3 for REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Figure 4 for REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space
Viaarxiv icon

DEPTH: Discourse Education through Pre-Training Hierarchically

Add code
May 13, 2024
Viaarxiv icon

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs

Add code
Apr 15, 2024
Viaarxiv icon

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Add code
Mar 31, 2024
Figure 1 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 2 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 3 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Figure 4 for Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
Viaarxiv icon

Jamba: A Hybrid Transformer-Mamba Language Model

Add code
Mar 28, 2024
Figure 1 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 2 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 3 for Jamba: A Hybrid Transformer-Mamba Language Model
Figure 4 for Jamba: A Hybrid Transformer-Mamba Language Model
Viaarxiv icon