Picture for Werner Geyer

Werner Geyer

Hide or Highlight: Understanding the Impact of Factuality Expression on User Trust

Add code
Aug 09, 2025
Viaarxiv icon

Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators

Add code
Aug 09, 2025
Viaarxiv icon

A Case Study Investigating the Role of Generative AI in Quality Evaluations of Epics in Agile Software Development

Add code
May 12, 2025
Viaarxiv icon

"The Diagram is like Guardrails": Structuring GenAI-assisted Hypotheses Exploration with an Interactive Shared Representation

Add code
Mar 21, 2025
Figure 1 for "The Diagram is like Guardrails": Structuring GenAI-assisted Hypotheses Exploration with an Interactive Shared Representation
Figure 2 for "The Diagram is like Guardrails": Structuring GenAI-assisted Hypotheses Exploration with an Interactive Shared Representation
Figure 3 for "The Diagram is like Guardrails": Structuring GenAI-assisted Hypotheses Exploration with an Interactive Shared Representation
Figure 4 for "The Diagram is like Guardrails": Structuring GenAI-assisted Hypotheses Exploration with an Interactive Shared Representation
Viaarxiv icon

NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning

Add code
Dec 20, 2024
Figure 1 for NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning
Figure 2 for NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning
Figure 3 for NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning
Figure 4 for NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning
Viaarxiv icon

Granite Guardian

Add code
Dec 10, 2024
Figure 1 for Granite Guardian
Figure 2 for Granite Guardian
Figure 3 for Granite Guardian
Figure 4 for Granite Guardian
Viaarxiv icon

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

Add code
Oct 18, 2024
Figure 1 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 2 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 3 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 4 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Viaarxiv icon

Black-box Uncertainty Quantification Method for LLM-as-a-Judge

Add code
Oct 15, 2024
Figure 1 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 2 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 3 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 4 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Viaarxiv icon

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Add code
Oct 03, 2024
Figure 1 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 2 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 3 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 4 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Figure 1 for Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
Figure 2 for Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
Figure 3 for Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
Figure 4 for Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
Viaarxiv icon