Picture for Gagan Bhatia

Gagan Bhatia

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

Add code
Apr 06, 2026
Viaarxiv icon

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Add code
Mar 19, 2026
Viaarxiv icon

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Add code
Mar 10, 2026
Viaarxiv icon

From RAG to Agentic RAG for Faithful Islamic Question Answering

Add code
Jan 12, 2026
Viaarxiv icon

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Add code
Jan 08, 2026
Viaarxiv icon

Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images

Add code
Jun 16, 2025
Figure 1 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 2 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 3 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Figure 4 for Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images
Viaarxiv icon

Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Add code
May 22, 2025
Figure 1 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 2 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 3 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Figure 4 for Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning
Viaarxiv icon

DateLogicQA: Benchmarking Temporal Biases in Large Language Models

Add code
Dec 17, 2024
Figure 1 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 2 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 3 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Figure 4 for DateLogicQA: Benchmarking Temporal Biases in Large Language Models
Viaarxiv icon

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Add code
Nov 02, 2024
Figure 1 for Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
Figure 2 for Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
Figure 3 for Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
Figure 4 for Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
Viaarxiv icon

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Add code
Jul 26, 2024
Figure 1 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 2 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 3 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Figure 4 for Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
Viaarxiv icon