Picture for Mark Gales

Mark Gales

Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons

Add code
May 09, 2024
Figure 1 for Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons
Figure 2 for Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons
Figure 3 for Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons
Figure 4 for Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons
Viaarxiv icon

Question Difficulty Ranking for Multiple-Choice Reading Comprehension

Add code
Apr 16, 2024
Viaarxiv icon

LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History

Add code
Feb 28, 2024
Figure 1 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 2 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 3 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Figure 4 for LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Viaarxiv icon

Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment

Add code
Feb 21, 2024
Figure 1 for Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Figure 2 for Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Figure 3 for Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Figure 4 for Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Viaarxiv icon

An Information-Theoretic Approach to Analyze NLP Classification Tasks

Add code
Feb 01, 2024
Viaarxiv icon

Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation

Add code
Nov 15, 2023
Figure 1 for Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation
Figure 2 for Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation
Figure 3 for Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation
Figure 4 for Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation
Viaarxiv icon

Assessing Distractors in Multiple-Choice Tests

Add code
Nov 08, 2023
Figure 1 for Assessing Distractors in Multiple-Choice Tests
Figure 2 for Assessing Distractors in Multiple-Choice Tests
Figure 3 for Assessing Distractors in Multiple-Choice Tests
Figure 4 for Assessing Distractors in Multiple-Choice Tests
Viaarxiv icon

Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models

Add code
Sep 22, 2023
Viaarxiv icon

Minimum Bayes' Risk Decoding for System Combination of Grammatical Error Correction Systems

Add code
Sep 12, 2023
Viaarxiv icon

Can Generative Large Language Models Perform ASR Error Correction?

Add code
Jul 09, 2023
Viaarxiv icon