Picture for Tathagata Raha

Tathagata Raha

Cross-Examination Framework: A Task-Agnostic Diagnostic for Information Fidelity in Text-to-Text Generation

Add code
Jan 27, 2026
Viaarxiv icon

Overalignment in Frontier LLMs: An Empirical Study of Sycophantic Behaviour in Healthcare

Add code
Jan 26, 2026
Viaarxiv icon

Named Clinical Entity Recognition Benchmark

Add code
Oct 07, 2024
Figure 1 for Named Clinical Entity Recognition Benchmark
Figure 2 for Named Clinical Entity Recognition Benchmark
Figure 3 for Named Clinical Entity Recognition Benchmark
Figure 4 for Named Clinical Entity Recognition Benchmark
Viaarxiv icon

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Add code
Sep 11, 2024
Figure 1 for MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Figure 2 for MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Figure 3 for MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Figure 4 for MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Viaarxiv icon

Med42-v2: A Suite of Clinical LLMs

Add code
Aug 12, 2024
Figure 1 for Med42-v2: A Suite of Clinical LLMs
Figure 2 for Med42-v2: A Suite of Clinical LLMs
Figure 3 for Med42-v2: A Suite of Clinical LLMs
Figure 4 for Med42-v2: A Suite of Clinical LLMs
Viaarxiv icon

Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks

Add code
Jul 29, 2024
Figure 1 for Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Figure 2 for Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Figure 3 for Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Figure 4 for Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Viaarxiv icon

iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

Add code
May 25, 2024
Figure 1 for iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers
Figure 2 for iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers
Figure 3 for iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers
Figure 4 for iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers
Viaarxiv icon

Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches

Add code
Apr 23, 2024
Figure 1 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 2 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 3 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Figure 4 for Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Viaarxiv icon

Neural models for Factual Inconsistency Classification with Explanations

Add code
Jun 15, 2023
Figure 1 for Neural models for Factual Inconsistency Classification with Explanations
Figure 2 for Neural models for Factual Inconsistency Classification with Explanations
Figure 3 for Neural models for Factual Inconsistency Classification with Explanations
Figure 4 for Neural models for Factual Inconsistency Classification with Explanations
Viaarxiv icon

Identifying COVID-19 Fake News in Social Media

Add code
Feb 01, 2021
Figure 1 for Identifying COVID-19 Fake News in Social Media
Figure 2 for Identifying COVID-19 Fake News in Social Media
Figure 3 for Identifying COVID-19 Fake News in Social Media
Viaarxiv icon