Picture for Meher Mankikar

Meher Mankikar

MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

Add code
Jul 23, 2025
Figure 1 for MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs
Figure 2 for MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs
Figure 3 for MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs
Figure 4 for MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs
Viaarxiv icon

FORTRESS: Frontier Risk Evaluation for National Security and Public Safety

Add code
Jun 17, 2025
Viaarxiv icon