Picture for Bhavdeep Sachdeva

Bhavdeep Sachdeva

Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow

Add code
Feb 09, 2023
Viaarxiv icon

NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks

Add code
Apr 12, 2022
Figure 1 for NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Figure 2 for NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Figure 3 for NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Figure 4 for NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Viaarxiv icon

DQI: A Guide to Benchmark Evaluation

Add code
Aug 10, 2020
Figure 1 for DQI: A Guide to Benchmark Evaluation
Figure 2 for DQI: A Guide to Benchmark Evaluation
Figure 3 for DQI: A Guide to Benchmark Evaluation
Figure 4 for DQI: A Guide to Benchmark Evaluation
Viaarxiv icon

Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks

Add code
May 18, 2020
Figure 1 for Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Figure 2 for Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Figure 3 for Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Figure 4 for Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Viaarxiv icon

DQI: Measuring Data Quality in NLP

Add code
May 02, 2020
Figure 1 for DQI: Measuring Data Quality in NLP
Figure 2 for DQI: Measuring Data Quality in NLP
Figure 3 for DQI: Measuring Data Quality in NLP
Figure 4 for DQI: Measuring Data Quality in NLP
Viaarxiv icon