Alert button
Picture for Marc Dymetman

Marc Dymetman

Alert button

Compositional preference models for aligning LMs

Add code
Bookmark button
Alert button
Oct 17, 2023
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Marc Dymetman

Viaarxiv icon

Should you marginalize over possible tokenizations?

Add code
Bookmark button
Alert button
Jun 30, 2023
Nadezhda Chirkova, Germán Kruszewski, Jos Rozen, Marc Dymetman

Figure 1 for Should you marginalize over possible tokenizations?
Figure 2 for Should you marginalize over possible tokenizations?
Figure 3 for Should you marginalize over possible tokenizations?
Figure 4 for Should you marginalize over possible tokenizations?
Viaarxiv icon

disco: a toolkit for Distributional Control of Generative Models

Add code
Bookmark button
Alert button
Mar 08, 2023
Germán Kruszewski, Jos Rozen, Marc Dymetman

Figure 1 for disco: a toolkit for Distributional Control of Generative Models
Figure 2 for disco: a toolkit for Distributional Control of Generative Models
Figure 3 for disco: a toolkit for Distributional Control of Generative Models
Figure 4 for disco: a toolkit for Distributional Control of Generative Models
Viaarxiv icon

Aligning Language Models with Preferences through f-divergence Minimization

Add code
Bookmark button
Alert button
Feb 16, 2023
Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman

Figure 1 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 2 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 3 for Aligning Language Models with Preferences through f-divergence Minimization
Figure 4 for Aligning Language Models with Preferences through f-divergence Minimization
Viaarxiv icon

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

Add code
Bookmark button
Alert button
Jun 01, 2022
Tomasz Korbak, Hady Elsahar, Germán Kruszewski, Marc Dymetman

Figure 1 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 2 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 3 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Figure 4 for On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Viaarxiv icon

Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs

Add code
Bookmark button
Alert button
Dec 10, 2021
Bryan Eikema, Germán Kruszewski, Hady Elsahar, Marc Dymetman

Figure 1 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 2 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 3 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Figure 4 for Sampling from Discrete Energy-Based Models with Quality/Efficiency Trade-offs
Viaarxiv icon

Controlling Conditional Language Models with Distributional Policy Gradients

Add code
Bookmark button
Alert button
Dec 01, 2021
Tomasz Korbak, Hady Elsahar, German Kruszewski, Marc Dymetman

Figure 1 for Controlling Conditional Language Models with Distributional Policy Gradients
Figure 2 for Controlling Conditional Language Models with Distributional Policy Gradients
Figure 3 for Controlling Conditional Language Models with Distributional Policy Gradients
Figure 4 for Controlling Conditional Language Models with Distributional Policy Gradients
Viaarxiv icon

Energy-Based Models for Code Generation under Compilability Constraints

Add code
Bookmark button
Alert button
Jun 09, 2021
Tomasz Korbak, Hady Elsahar, Marc Dymetman, Germán Kruszewski

Figure 1 for Energy-Based Models for Code Generation under Compilability Constraints
Figure 2 for Energy-Based Models for Code Generation under Compilability Constraints
Figure 3 for Energy-Based Models for Code Generation under Compilability Constraints
Figure 4 for Energy-Based Models for Code Generation under Compilability Constraints
Viaarxiv icon

A Distributional Approach to Controlled Text Generation

Add code
Bookmark button
Alert button
Dec 21, 2020
Muhammad Khalifa, Hady Elsahar, Marc Dymetman

Figure 1 for A Distributional Approach to Controlled Text Generation
Figure 2 for A Distributional Approach to Controlled Text Generation
Figure 3 for A Distributional Approach to Controlled Text Generation
Figure 4 for A Distributional Approach to Controlled Text Generation
Viaarxiv icon

Distributional Reinforcement Learning for Energy-Based Sequential Models

Add code
Bookmark button
Alert button
Dec 18, 2019
Tetiana Parshakova, Jean-Marc Andreoli, Marc Dymetman

Figure 1 for Distributional Reinforcement Learning for Energy-Based Sequential Models
Figure 2 for Distributional Reinforcement Learning for Energy-Based Sequential Models
Figure 3 for Distributional Reinforcement Learning for Energy-Based Sequential Models
Figure 4 for Distributional Reinforcement Learning for Energy-Based Sequential Models
Viaarxiv icon