Picture for Negar Foroutan

Negar Foroutan

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Add code
Jun 26, 2025
Viaarxiv icon

WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts

Add code
Jun 18, 2025
Viaarxiv icon

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification

Add code
Jun 18, 2025
Viaarxiv icon

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Add code
Nov 29, 2024
Figure 1 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 2 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 3 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 4 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Viaarxiv icon

How Do Multilingual Models Remember? Investigating Multilingual Factual Recall Mechanisms

Add code
Oct 18, 2024
Viaarxiv icon

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

Add code
Aug 07, 2024
Figure 1 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 2 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 3 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 4 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Viaarxiv icon

Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention

Add code
Oct 23, 2023
Viaarxiv icon

Discovering Knowledge-Critical Subnetworks in Pretrained Language Models

Add code
Oct 04, 2023
Viaarxiv icon

Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages

Add code
Jun 29, 2023
Figure 1 for Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages
Figure 2 for Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages
Figure 3 for Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages
Figure 4 for Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages
Viaarxiv icon

Discovering Language-neutral Sub-networks in Multilingual Language Models

Add code
May 25, 2022
Figure 1 for Discovering Language-neutral Sub-networks in Multilingual Language Models
Figure 2 for Discovering Language-neutral Sub-networks in Multilingual Language Models
Figure 3 for Discovering Language-neutral Sub-networks in Multilingual Language Models
Figure 4 for Discovering Language-neutral Sub-networks in Multilingual Language Models
Viaarxiv icon