Picture for Sheikh Shafayat

Sheikh Shafayat

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Add code
Jun 09, 2024
Figure 1 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 2 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 3 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 4 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Viaarxiv icon

BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English

Add code
Mar 16, 2024
Figure 1 for BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English
Figure 2 for BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English
Figure 3 for BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English
Figure 4 for BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English
Viaarxiv icon

Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore

Add code
Mar 01, 2024
Viaarxiv icon

LangBridge: Multilingual Reasoning Without Multilingual Supervision

Add code
Jan 19, 2024
Figure 1 for LangBridge: Multilingual Reasoning Without Multilingual Supervision
Figure 2 for LangBridge: Multilingual Reasoning Without Multilingual Supervision
Figure 3 for LangBridge: Multilingual Reasoning Without Multilingual Supervision
Figure 4 for LangBridge: Multilingual Reasoning Without Multilingual Supervision
Viaarxiv icon