Topic


MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems

Add code
Sep 11, 2025
Viaarxiv icon

"A 6 or a 9?": Ensemble Learning Through the Multiplicity of Performant Models and Explanations

Add code
Sep 11, 2025
Viaarxiv icon

Short-term cognitive fatigue of spatial selective attention after face-to-face conversations in virtual noisy environments

Add code
Sep 11, 2025
Viaarxiv icon

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Add code
Sep 10, 2025
Viaarxiv icon

Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks

Add code
Sep 10, 2025
Figure 1 for Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Figure 2 for Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Figure 3 for Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Figure 4 for Heart Disease Prediction: A Comparative Study of Optimisers Performance in Deep Neural Networks
Viaarxiv icon

Probing the Preferences of a Language Model: Integrating Verbal and Behavioral Tests of AI Welfare

Add code
Sep 09, 2025
Viaarxiv icon

Getting In Contract with Large Language Models -- An Agency Theory Perspective On Large Language Model Alignment

Add code
Sep 09, 2025
Viaarxiv icon

Two Stage Context Learning with Large Language Models for Multimodal Stance Detection on Climate Change

Add code
Sep 09, 2025
Viaarxiv icon

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Add code
Sep 09, 2025
Viaarxiv icon

Biased Tales: Cultural and Topic Bias in Generating Children's Stories

Add code
Sep 09, 2025
Viaarxiv icon