Alert button
Picture for Alessandro Sordoni

Alessandro Sordoni

Alert button

V-STaR: Training Verifiers for Self-Taught Reasoners

Add code
Bookmark button
Alert button
Feb 09, 2024
Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

Viaarxiv icon

Guiding Language Model Reasoning with Planning Tokens

Add code
Bookmark button
Alert button
Oct 09, 2023
Xinyi Wang, Lucas Caccia, Oleksiy Ostapenko, Xingdi Yuan, Alessandro Sordoni

Figure 1 for Guiding Language Model Reasoning with Planning Tokens
Figure 2 for Guiding Language Model Reasoning with Planning Tokens
Figure 3 for Guiding Language Model Reasoning with Planning Tokens
Figure 4 for Guiding Language Model Reasoning with Planning Tokens
Viaarxiv icon

Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference

Add code
Bookmark button
Alert button
Jun 21, 2023
Alessandro Sordoni, Xingdi Yuan, Marc-Alexandre Côté, Matheus Pereira, Adam Trischler, Ziang Xiao, Arian Hosseini, Friederike Niedtner, Nicolas Le Roux

Viaarxiv icon

On the Compositional Generalization Gap of In-Context Learning

Add code
Bookmark button
Alert button
Nov 15, 2022
Arian Hosseini, Ankit Vani, Dzmitry Bahdanau, Alessandro Sordoni, Aaron Courville

Figure 1 for On the Compositional Generalization Gap of In-Context Learning
Figure 2 for On the Compositional Generalization Gap of In-Context Learning
Figure 3 for On the Compositional Generalization Gap of In-Context Learning
Figure 4 for On the Compositional Generalization Gap of In-Context Learning
Viaarxiv icon

Multi-Head Adapter Routing for Data-Efficient Fine-Tuning

Add code
Bookmark button
Alert button
Nov 07, 2022
Lucas Caccia, Edoardo Ponti, Lucas Liu, Matheus Pereira, Nicolas Le Roux, Alessandro Sordoni

Figure 1 for Multi-Head Adapter Routing for Data-Efficient Fine-Tuning
Figure 2 for Multi-Head Adapter Routing for Data-Efficient Fine-Tuning
Figure 3 for Multi-Head Adapter Routing for Data-Efficient Fine-Tuning
Figure 4 for Multi-Head Adapter Routing for Data-Efficient Fine-Tuning
Viaarxiv icon

Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning

Add code
Bookmark button
Alert button
Jun 02, 2022
Yuchen Lu, Zhen Liu, Aristide Baratin, Romain Laroche, Aaron Courville, Alessandro Sordoni

Figure 1 for Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning
Figure 2 for Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning
Figure 3 for Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning
Figure 4 for Expressiveness and Learnability: A Unifying View for Evaluating Self-Supervised Learning
Viaarxiv icon

Evaluating Distributional Distortion in Neural Language Modeling

Add code
Bookmark button
Alert button
Mar 24, 2022
Benjamin LeBrun, Alessandro Sordoni, Timothy J. O'Donnell

Figure 1 for Evaluating Distributional Distortion in Neural Language Modeling
Figure 2 for Evaluating Distributional Distortion in Neural Language Modeling
Figure 3 for Evaluating Distributional Distortion in Neural Language Modeling
Figure 4 for Evaluating Distributional Distortion in Neural Language Modeling
Viaarxiv icon

Better Language Model with Hypernym Class Prediction

Add code
Bookmark button
Alert button
Mar 21, 2022
He Bai, Tong Wang, Alessandro Sordoni, Peng Shi

Figure 1 for Better Language Model with Hypernym Class Prediction
Figure 2 for Better Language Model with Hypernym Class Prediction
Figure 3 for Better Language Model with Hypernym Class Prediction
Figure 4 for Better Language Model with Hypernym Class Prediction
Viaarxiv icon

Combining Modular Skills in Multitask Learning

Add code
Bookmark button
Alert button
Mar 01, 2022
Edoardo M. Ponti, Alessandro Sordoni, Yoshua Bengio, Siva Reddy

Figure 1 for Combining Modular Skills in Multitask Learning
Figure 2 for Combining Modular Skills in Multitask Learning
Figure 3 for Combining Modular Skills in Multitask Learning
Figure 4 for Combining Modular Skills in Multitask Learning
Viaarxiv icon

Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge

Add code
Bookmark button
Alert button
Dec 16, 2021
Ian Porada, Alessandro Sordoni, Jackie Chi Kit Cheung

Figure 1 for Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge
Figure 2 for Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge
Figure 3 for Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge
Viaarxiv icon