Picture for Vipul Gupta

Vipul Gupta

Changing Answer Order Can Decrease MMLU Accuracy

Add code
Jun 27, 2024
Figure 1 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 2 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 3 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 4 for Changing Answer Order Can Decrease MMLU Accuracy
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Viaarxiv icon

"Confidently Nonsensical?'': A Critical Survey on the Perspectives and Challenges of 'Hallucinations' in NLP

Add code
Apr 11, 2024
Viaarxiv icon

Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process

Add code
Feb 16, 2024
Figure 1 for Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process
Figure 2 for Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process
Figure 3 for Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process
Figure 4 for Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process
Viaarxiv icon

The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis

Add code
Oct 18, 2023
Figure 1 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 2 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 3 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 4 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Viaarxiv icon

CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias

Add code
Aug 24, 2023
Figure 1 for CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias
Figure 2 for CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias
Figure 3 for CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias
Figure 4 for CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias
Viaarxiv icon

Semantic Consistency for Assuring Reliability of Large Language Models

Add code
Aug 17, 2023
Figure 1 for Semantic Consistency for Assuring Reliability of Large Language Models
Figure 2 for Semantic Consistency for Assuring Reliability of Large Language Models
Figure 3 for Semantic Consistency for Assuring Reliability of Large Language Models
Figure 4 for Semantic Consistency for Assuring Reliability of Large Language Models
Viaarxiv icon

Survey on Sociodemographic Bias in Natural Language Processing

Add code
Jun 27, 2023
Figure 1 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 2 for Survey on Sociodemographic Bias in Natural Language Processing
Viaarxiv icon

Do we need entire training data for adversarial training?

Add code
Mar 10, 2023
Figure 1 for Do we need entire training data for adversarial training?
Figure 2 for Do we need entire training data for adversarial training?
Figure 3 for Do we need entire training data for adversarial training?
Figure 4 for Do we need entire training data for adversarial training?
Viaarxiv icon

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

Add code
Apr 05, 2022
Figure 1 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 2 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 3 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 4 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Viaarxiv icon