Alert button
Picture for Preslav Nakov

Preslav Nakov

Alert button

EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models

Mar 15, 2024
Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov

Viaarxiv icon

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

Mar 07, 2024
Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov

Figure 1 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 2 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 3 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Figure 4 for Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Viaarxiv icon

Multimodal Large Language Models to Support Real-World Fact-Checking

Mar 06, 2024
Jiahui Geng, Yova Kementchedjhieva, Preslav Nakov, Iryna Gurevych

Figure 1 for Multimodal Large Language Models to Support Real-World Fact-Checking
Figure 2 for Multimodal Large Language Models to Support Real-World Fact-Checking
Figure 3 for Multimodal Large Language Models to Support Real-World Fact-Checking
Figure 4 for Multimodal Large Language Models to Support Real-World Fact-Checking
Viaarxiv icon

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Feb 20, 2024
Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin

Viaarxiv icon

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Feb 19, 2024
Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin

Viaarxiv icon

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

Feb 17, 2024
Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohanned Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

Viaarxiv icon

Factuality of Large Language Models in the Year 2024

Feb 09, 2024
Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Georgiev, Rocktim Jyoti Das, Preslav Nakov

Viaarxiv icon

Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

Jan 30, 2024
Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Preslav Nakov, Tanmoy Chakraborty, Roy Ka-Wei Lee

Viaarxiv icon

Generating Unsupervised Abstractive Explanations for Rumour Verification

Jan 23, 2024
Iman Munire Bilal, Preslav Nakov, Rob Procter, Maria Liakata

Viaarxiv icon