Picture for Runing Yang

Runing Yang

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

Add code
Sep 30, 2025
Viaarxiv icon

Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?

Add code
Jun 25, 2024
Viaarxiv icon

InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

Add code
Jun 17, 2024
Figure 1 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 2 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 3 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 4 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Viaarxiv icon