Picture for Daniil Orel

Daniil Orel

Michael Pokorny

Why Don't You Know? Evaluating the Impact of Uncertainty Sources on Uncertainty Quantification in LLMs

Add code
Apr 12, 2026
Viaarxiv icon

AICD Bench: A Challenging Benchmark for AI-Generated Code Detection

Add code
Feb 02, 2026
Viaarxiv icon

A Parallel Cross-Lingual Benchmark for Multimodal Idiomaticity Understanding

Add code
Jan 13, 2026
Viaarxiv icon

FRaN-X: FRaming and Narratives-eXplorer

Add code
Jul 09, 2025
Viaarxiv icon

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

Add code
May 30, 2025
Figure 1 for CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Figure 2 for CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Figure 3 for CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Figure 4 for CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Viaarxiv icon

LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs

Add code
May 17, 2025
Figure 1 for LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Figure 2 for LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Figure 3 for LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Figure 4 for LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Viaarxiv icon

CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings

Add code
Mar 17, 2025
Figure 1 for CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings
Figure 2 for CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings
Figure 3 for CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings
Figure 4 for CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings
Viaarxiv icon

Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

Add code
Mar 03, 2025
Viaarxiv icon

Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts

Add code
Feb 19, 2025
Viaarxiv icon

Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh

Add code
Feb 19, 2025
Viaarxiv icon