Picture for Mor Geva

Mor Geva

Shammie

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Add code
Jul 08, 2024
Viaarxiv icon

Estimating Knowledge in Large Language Models Without Generating a Single Token

Add code
Jun 18, 2024
Figure 1 for Estimating Knowledge in Large Language Models Without Generating a Single Token
Figure 2 for Estimating Knowledge in Large Language Models Without Generating a Single Token
Figure 3 for Estimating Knowledge in Large Language Models Without Generating a Single Token
Figure 4 for Estimating Knowledge in Large Language Models Without Generating a Single Token
Viaarxiv icon

From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP

Add code
Jun 18, 2024
Viaarxiv icon

Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries

Add code
Jun 18, 2024
Viaarxiv icon

Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces

Add code
Jun 17, 2024
Viaarxiv icon

Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?

Add code
May 27, 2024
Viaarxiv icon

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

Add code
Feb 27, 2024
Viaarxiv icon

Do Large Language Models Latently Perform Multi-Hop Reasoning?

Add code
Feb 26, 2024
Figure 1 for Do Large Language Models Latently Perform Multi-Hop Reasoning?
Figure 2 for Do Large Language Models Latently Perform Multi-Hop Reasoning?
Figure 3 for Do Large Language Models Latently Perform Multi-Hop Reasoning?
Figure 4 for Do Large Language Models Latently Perform Multi-Hop Reasoning?
Viaarxiv icon

The Hidden Space of Transformer Language Adapters

Add code
Feb 20, 2024
Figure 1 for The Hidden Space of Transformer Language Adapters
Figure 2 for The Hidden Space of Transformer Language Adapters
Figure 3 for The Hidden Space of Transformer Language Adapters
Figure 4 for The Hidden Space of Transformer Language Adapters
Viaarxiv icon

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Add code
Feb 20, 2024
Figure 1 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 2 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 3 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Figure 4 for Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
Viaarxiv icon