Picture for Roman Vashurin

Roman Vashurin

Mohamed bin Zayed University of Artificial Intelligence

Why Don't You Know? Evaluating the Impact of Uncertainty Sources on Uncertainty Quantification in LLMs

Add code
Apr 12, 2026
Viaarxiv icon

ReDAct: Uncertainty-Aware Deferral for LLM Agents

Add code
Apr 08, 2026
Viaarxiv icon

Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search

Add code
Dec 10, 2025
Viaarxiv icon

Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation

Add code
May 28, 2025
Viaarxiv icon

UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models

Add code
May 25, 2025
Viaarxiv icon

CoCoA: A Generalized Approach to Uncertainty Quantification by Integrating Confidence and Consistency of LLM Outputs

Add code
Feb 07, 2025
Viaarxiv icon

Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph

Add code
Jun 21, 2024
Figure 1 for Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Figure 2 for Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Figure 3 for Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Figure 4 for Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Viaarxiv icon

LM-Polygraph: Uncertainty Estimation for Language Models

Add code
Nov 13, 2023
Figure 1 for LM-Polygraph: Uncertainty Estimation for Language Models
Figure 2 for LM-Polygraph: Uncertainty Estimation for Language Models
Figure 3 for LM-Polygraph: Uncertainty Estimation for Language Models
Figure 4 for LM-Polygraph: Uncertainty Estimation for Language Models
Viaarxiv icon

Embedded Ensembles: Infinite Width Limit and Operating Regimes

Add code
Feb 24, 2022
Figure 1 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 2 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 3 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Figure 4 for Embedded Ensembles: Infinite Width Limit and Operating Regimes
Viaarxiv icon