Picture for Roy Schwartz

Roy Schwartz

On the Power of Saturated Transformers: A View from Circuit Complexity

Add code
Jun 30, 2021
Figure 1 for On the Power of Saturated Transformers: A View from Circuit Complexity
Figure 2 for On the Power of Saturated Transformers: A View from Circuit Complexity
Figure 3 for On the Power of Saturated Transformers: A View from Circuit Complexity
Figure 4 for On the Power of Saturated Transformers: A View from Circuit Complexity
Viaarxiv icon

Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand?

Add code
Apr 22, 2021
Figure 1 for Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand?
Figure 2 for Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand?
Figure 3 for Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand?
Figure 4 for Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand?
Viaarxiv icon

Random Feature Attention

Add code
Mar 19, 2021
Figure 1 for Random Feature Attention
Figure 2 for Random Feature Attention
Figure 3 for Random Feature Attention
Figure 4 for Random Feature Attention
Viaarxiv icon

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA

Add code
Mar 17, 2021
Figure 1 for Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
Figure 2 for Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
Figure 3 for Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
Figure 4 for Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
Viaarxiv icon

Parameter Norm Growth During Training of Transformers

Add code
Nov 11, 2020
Figure 1 for Parameter Norm Growth During Training of Transformers
Figure 2 for Parameter Norm Growth During Training of Transformers
Figure 3 for Parameter Norm Growth During Training of Transformers
Figure 4 for Parameter Norm Growth During Training of Transformers
Viaarxiv icon

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Add code
Oct 15, 2020
Figure 1 for Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Figure 2 for Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Figure 3 for Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Figure 4 for Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
Viaarxiv icon

Extracting a Knowledge Base of Mechanisms from COVID-19 Papers

Add code
Oct 08, 2020
Figure 1 for Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Figure 2 for Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Figure 3 for Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Figure 4 for Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
Viaarxiv icon

A Mixture of $h-1$ Heads is Better than $h$ Heads

Add code
May 13, 2020
Figure 1 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 2 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 3 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Figure 4 for A Mixture of $h-1$ Heads is Better than $h$ Heads
Viaarxiv icon

The Right Tool for the Job: Matching Model and Instance Complexities

Add code
May 09, 2020
Figure 1 for The Right Tool for the Job: Matching Model and Instance Complexities
Figure 2 for The Right Tool for the Job: Matching Model and Instance Complexities
Figure 3 for The Right Tool for the Job: Matching Model and Instance Complexities
Figure 4 for The Right Tool for the Job: Matching Model and Instance Complexities
Viaarxiv icon

A Formal Hierarchy of RNN Architectures

Add code
Apr 24, 2020
Figure 1 for A Formal Hierarchy of RNN Architectures
Figure 2 for A Formal Hierarchy of RNN Architectures
Figure 3 for A Formal Hierarchy of RNN Architectures
Figure 4 for A Formal Hierarchy of RNN Architectures
Viaarxiv icon