Picture for Samuel R. Bowman

Samuel R. Bowman

Shammie

QuALITY: Question Answering with Long Input Texts, Yes!

Add code
Dec 16, 2021
Figure 1 for QuALITY: Question Answering with Long Input Texts, Yes!
Figure 2 for QuALITY: Question Answering with Long Input Texts, Yes!
Figure 3 for QuALITY: Question Answering with Long Input Texts, Yes!
Figure 4 for QuALITY: Question Answering with Long Input Texts, Yes!
Viaarxiv icon

Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair

Add code
Nov 16, 2021
Figure 1 for Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Figure 2 for Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Figure 3 for Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Figure 4 for Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Viaarxiv icon

Learning with Noisy Labels by Targeted Relabeling

Add code
Oct 15, 2021
Figure 1 for Learning with Noisy Labels by Targeted Relabeling
Figure 2 for Learning with Noisy Labels by Targeted Relabeling
Figure 3 for Learning with Noisy Labels by Targeted Relabeling
Figure 4 for Learning with Noisy Labels by Targeted Relabeling
Viaarxiv icon

When Combating Hype, Proceed with Caution

Add code
Oct 15, 2021
Figure 1 for When Combating Hype, Proceed with Caution
Figure 2 for When Combating Hype, Proceed with Caution
Figure 3 for When Combating Hype, Proceed with Caution
Viaarxiv icon

BBQ: A Hand-Built Bias Benchmark for Question Answering

Add code
Oct 15, 2021
Figure 1 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 2 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 3 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 4 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Viaarxiv icon

Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers

Add code
Sep 20, 2021
Figure 1 for Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Figure 2 for Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Figure 3 for Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Figure 4 for Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Viaarxiv icon

NOPE: A Corpus of Naturally-Occurring Presuppositions in English

Add code
Sep 14, 2021
Figure 1 for NOPE: A Corpus of Naturally-Occurring Presuppositions in English
Figure 2 for NOPE: A Corpus of Naturally-Occurring Presuppositions in English
Figure 3 for NOPE: A Corpus of Naturally-Occurring Presuppositions in English
Figure 4 for NOPE: A Corpus of Naturally-Occurring Presuppositions in English
Viaarxiv icon

Comparing Test Sets with Item Response Theory

Add code
Jun 01, 2021
Figure 1 for Comparing Test Sets with Item Response Theory
Figure 2 for Comparing Test Sets with Item Response Theory
Figure 3 for Comparing Test Sets with Item Response Theory
Figure 4 for Comparing Test Sets with Item Response Theory
Viaarxiv icon

What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?

Add code
Jun 01, 2021
Figure 1 for What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
Figure 2 for What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
Figure 3 for What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
Figure 4 for What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
Viaarxiv icon

Does Putting a Linguist in the Loop Improve NLU Data Collection?

Add code
Apr 15, 2021
Figure 1 for Does Putting a Linguist in the Loop Improve NLU Data Collection?
Figure 2 for Does Putting a Linguist in the Loop Improve NLU Data Collection?
Figure 3 for Does Putting a Linguist in the Loop Improve NLU Data Collection?
Figure 4 for Does Putting a Linguist in the Loop Improve NLU Data Collection?
Viaarxiv icon