Picture for Tristan Thrush

Tristan Thrush

Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality

Add code
Apr 07, 2022
Figure 1 for Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
Figure 2 for Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
Figure 3 for Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
Figure 4 for Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
Viaarxiv icon

Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks

Add code
Apr 05, 2022
Figure 1 for Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
Figure 2 for Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
Figure 3 for Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
Figure 4 for Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
Viaarxiv icon

Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants

Add code
Dec 16, 2021
Figure 1 for Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Figure 2 for Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Figure 3 for Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Figure 4 for Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Viaarxiv icon

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate

Add code
Aug 31, 2021
Figure 1 for Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Figure 2 for Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Figure 3 for Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Figure 4 for Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Viaarxiv icon

Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking

Add code
May 21, 2021
Figure 1 for Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Figure 2 for Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Figure 3 for Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Figure 4 for Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Viaarxiv icon

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation

Add code
Apr 18, 2021
Figure 1 for Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Figure 2 for Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Figure 3 for Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Figure 4 for Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Viaarxiv icon

Dynabench: Rethinking Benchmarking in NLP

Add code
Apr 07, 2021
Figure 1 for Dynabench: Rethinking Benchmarking in NLP
Figure 2 for Dynabench: Rethinking Benchmarking in NLP
Figure 3 for Dynabench: Rethinking Benchmarking in NLP
Viaarxiv icon

Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching

Add code
Mar 05, 2021
Figure 1 for Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching
Figure 2 for Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching
Figure 3 for Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching
Figure 4 for Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching
Viaarxiv icon

Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection

Add code
Dec 31, 2020
Figure 1 for Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Figure 2 for Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Figure 3 for Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Figure 4 for Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Viaarxiv icon

Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization

Add code
Nov 04, 2020
Figure 1 for Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
Figure 2 for Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
Figure 3 for Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
Figure 4 for Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
Viaarxiv icon