Picture for Denis Janiak

Denis Janiak

Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety

Add code
Sep 16, 2025
Viaarxiv icon

FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Add code
Mar 21, 2025
Figure 1 for FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
Figure 2 for FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
Figure 3 for FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
Figure 4 for FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
Viaarxiv icon

Hallucination Detection in LLMs Using Spectral Features of Attention Maps

Add code
Feb 24, 2025
Figure 1 for Hallucination Detection in LLMs Using Spectral Features of Attention Maps
Figure 2 for Hallucination Detection in LLMs Using Spectral Features of Attention Maps
Figure 3 for Hallucination Detection in LLMs Using Spectral Features of Attention Maps
Figure 4 for Hallucination Detection in LLMs Using Spectral Features of Attention Maps
Viaarxiv icon

Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning

Add code
Oct 27, 2023
Figure 1 for Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning
Figure 2 for Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning
Figure 3 for Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning
Figure 4 for Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning
Viaarxiv icon

Graph-level representations using ensemble-based readout functions

Add code
Mar 03, 2023
Figure 1 for Graph-level representations using ensemble-based readout functions
Figure 2 for Graph-level representations using ensemble-based readout functions
Figure 3 for Graph-level representations using ensemble-based readout functions
Figure 4 for Graph-level representations using ensemble-based readout functions
Viaarxiv icon

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

Add code
Nov 23, 2022
Viaarxiv icon