Picture for Preslav Nakov

Preslav Nakov

Mohamed bin Zayed University of Artificial Intelligence

Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Add code
Apr 26, 2024
Figure 1 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 2 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 3 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 4 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Viaarxiv icon

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

Add code
Apr 22, 2024
Figure 1 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 2 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 3 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Figure 4 for SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Viaarxiv icon

EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models

Add code
Mar 15, 2024
Figure 1 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 2 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 3 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 4 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Viaarxiv icon

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

Add code
Mar 07, 2024
Figure 1 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 2 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 3 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 4 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Viaarxiv icon

Multimodal Large Language Models to Support Real-World Fact-Checking

Add code
Mar 06, 2024
Viaarxiv icon

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Add code
Feb 20, 2024
Figure 1 for ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Figure 2 for ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Figure 3 for ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Figure 4 for ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Viaarxiv icon

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Add code
Feb 19, 2024
Figure 1 for A Chinese Dataset for Evaluating the Safeguards in Large Language Models
Figure 2 for A Chinese Dataset for Evaluating the Safeguards in Large Language Models
Figure 3 for A Chinese Dataset for Evaluating the Safeguards in Large Language Models
Figure 4 for A Chinese Dataset for Evaluating the Safeguards in Large Language Models
Viaarxiv icon

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

Add code
Feb 17, 2024
Figure 1 for M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
Figure 2 for M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
Figure 3 for M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
Figure 4 for M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
Viaarxiv icon

Factuality of Large Language Models in the Year 2024

Add code
Feb 09, 2024
Figure 1 for Factuality of Large Language Models in the Year 2024
Figure 2 for Factuality of Large Language Models in the Year 2024
Figure 3 for Factuality of Large Language Models in the Year 2024
Viaarxiv icon

Generating Unsupervised Abstractive Explanations for Rumour Verification

Add code
Jan 23, 2024
Figure 1 for Generating Unsupervised Abstractive Explanations for Rumour Verification
Figure 2 for Generating Unsupervised Abstractive Explanations for Rumour Verification
Figure 3 for Generating Unsupervised Abstractive Explanations for Rumour Verification
Figure 4 for Generating Unsupervised Abstractive Explanations for Rumour Verification
Viaarxiv icon