Picture for Germán Kruszewski

Germán Kruszewski

Shammie

FaST: Feature-aware Sampling and Tuning for Personalized Preference Alignment with Limited Data

Add code
Aug 06, 2025
Viaarxiv icon

Compositional preference models for aligning LMs

Add code
Oct 17, 2023
Figure 1 for Compositional preference models for aligning LMs
Figure 2 for Compositional preference models for aligning LMs
Figure 3 for Compositional preference models for aligning LMs
Figure 4 for Compositional preference models for aligning LMs
Viaarxiv icon

Should you marginalize over possible tokenizations?

Add code
Jun 30, 2023
Viaarxiv icon

disco: a toolkit for Distributional Control of Generative Models

Add code
Mar 08, 2023
Figure 1 for disco: a toolkit for Distributional Control of Generative Models
Figure 2 for disco: a toolkit for Distributional Control of Generative Models
Figure 3 for disco: a toolkit for Distributional Control of Generative Models
Figure 4 for disco: a toolkit for Distributional Control of Generative Models
Viaarxiv icon

Aligning Language Models with Preferences through f-divergence Minimization

Add code
Feb 16, 2023
Figure 1 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 2 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 3 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 4 for Aligning Language Models with Preferences through f-divergence Minimization
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

Add code
Jun 01, 2022
Figure 1 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 2 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 3 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 4 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Viaarxiv icon

Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs

Add code
Dec 10, 2021
Figure 1 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 2 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 3 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 4 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Viaarxiv icon

Unsupervised and Distributional Detection of Machine-Generated Text

Add code
Nov 04, 2021
Figure 1 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 2 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 3 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 4 for Unsupervised and Distributional Detection of Machine-Generated Text
Viaarxiv icon