Picture for Ronan Le Bras

Ronan Le Bras

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Add code
Jun 07, 2024
Viaarxiv icon

NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation

Add code
Dec 10, 2023
Figure 1 for NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Figure 2 for NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Figure 3 for NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Figure 4 for NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Viaarxiv icon

MacGyver: Are Large Language Models Creative Problem Solvers?

Add code
Nov 16, 2023
Figure 1 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 2 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 3 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 4 for MacGyver: Are Large Language Models Creative Problem Solvers?
Viaarxiv icon

FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions

Add code
Oct 31, 2023
Figure 1 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 2 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 3 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 4 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Viaarxiv icon

Commonsense Knowledge Transfer for Pre-trained Language Models

Add code
Jun 04, 2023
Figure 1 for Commonsense Knowledge Transfer for Pre-trained Language Models
Figure 2 for Commonsense Knowledge Transfer for Pre-trained Language Models
Figure 3 for Commonsense Knowledge Transfer for Pre-trained Language Models
Figure 4 for Commonsense Knowledge Transfer for Pre-trained Language Models
Viaarxiv icon

Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference

Add code
Jun 04, 2023
Figure 1 for Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
Figure 2 for Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
Figure 3 for Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
Figure 4 for Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
Viaarxiv icon

NLPositionality: Characterizing Design Biases of Datasets and Models

Add code
Jun 02, 2023
Figure 1 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 2 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 3 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 4 for NLPositionality: Characterizing Design Biases of Datasets and Models
Viaarxiv icon

Faith and Fate: Limits of Transformers on Compositionality

Add code
Jun 01, 2023
Figure 1 for Faith and Fate: Limits of Transformers on Compositionality
Figure 2 for Faith and Fate: Limits of Transformers on Compositionality
Figure 3 for Faith and Fate: Limits of Transformers on Compositionality
Figure 4 for Faith and Fate: Limits of Transformers on Compositionality
Viaarxiv icon

From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models

Add code
May 26, 2023
Figure 1 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 2 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 3 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 4 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Viaarxiv icon

Improving Language Models with Advantage-based Offline Policy Gradients

Add code
May 24, 2023
Figure 1 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 2 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 3 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 4 for Improving Language Models with Advantage-based Offline Policy Gradients
Viaarxiv icon