Picture for Lucas Dixon

Lucas Dixon

Who's asking? User personas and the mechanics of latent misalignment

Add code
Jun 17, 2024
Figure 1 for Who's asking? User personas and the mechanics of latent misalignment
Figure 2 for Who's asking? User personas and the mechanics of latent misalignment
Figure 3 for Who's asking? User personas and the mechanics of latent misalignment
Figure 4 for Who's asking? User personas and the mechanics of latent misalignment
Viaarxiv icon

Interactive Prompt Debugging with Sequence Salience

Add code
Apr 11, 2024
Viaarxiv icon

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Add code
Mar 15, 2024
Figure 1 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 2 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 3 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Figure 4 for PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

Add code
Mar 13, 2024
Figure 1 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 2 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 3 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Figure 4 for Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models

Add code
Feb 16, 2024
Viaarxiv icon

Decoding-time Realignment of Language Models

Add code
Feb 05, 2024
Figure 1 for Decoding-time Realignment of Language Models
Figure 2 for Decoding-time Realignment of Language Models
Figure 3 for Decoding-time Realignment of Language Models
Figure 4 for Decoding-time Realignment of Language Models
Viaarxiv icon

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

Add code
Jan 12, 2024
Figure 1 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 2 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 3 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 4 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon