Picture for Lucas Dixon

Lucas Dixon

Decoding-time Realignment of Language Models

Add code
Feb 05, 2024
Viaarxiv icon

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

Add code
Jan 12, 2024
Figure 1 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 2 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 3 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Figure 4 for Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Interpretability Illusions in the Generalization of Simplified Models

Add code
Dec 06, 2023
Figure 1 for Interpretability Illusions in the Generalization of Simplified Models
Figure 2 for Interpretability Illusions in the Generalization of Simplified Models
Figure 3 for Interpretability Illusions in the Generalization of Simplified Models
Figure 4 for Interpretability Illusions in the Generalization of Simplified Models
Viaarxiv icon

AI Alignment in the Design of Interactive AI: Specification Alignment, Process Alignment, and Evaluation Support

Add code
Oct 23, 2023
Viaarxiv icon

Large Language Models are Competitive Near Cold-start Recommenders for Language- and Item-based Preferences

Add code
Jul 26, 2023
Viaarxiv icon

Large Language Models for User Interest Journeys

Add code
May 24, 2023
Figure 1 for Large Language Models for User Interest Journeys
Figure 2 for Large Language Models for User Interest Journeys
Figure 3 for Large Language Models for User Interest Journeys
Figure 4 for Large Language Models for User Interest Journeys
Viaarxiv icon

Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs

Add code
Mar 14, 2023
Viaarxiv icon

Towards Agile Text Classifiers for Everyone

Add code
Feb 13, 2023
Viaarxiv icon

Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning

Add code
Feb 13, 2023
Viaarxiv icon