Alert button
Picture for Tanguy Urvoy

Tanguy Urvoy

Alert button

FT R and D

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Add code
Bookmark button
Alert button
Feb 22, 2023
Thibault Cordier, Tanguy Urvoy, Fabrice Lefevre, Lina M. Rojas-Barahona

Figure 1 for Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues
Figure 2 for Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues
Figure 3 for Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues
Figure 4 for Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues
Viaarxiv icon

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Add code
Bookmark button
Alert button
Oct 11, 2022
Thibault Cordier, Tanguy Urvoy, Fabrice Lefèvre, Lina M. Rojas-Barahona

Figure 1 for Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues
Figure 2 for Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues
Figure 3 for Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues
Figure 4 for Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues
Viaarxiv icon

Denoising Pre-Training and Data Augmentation Strategies for Enhanced RDF Verbalization with Transformers

Add code
Bookmark button
Alert button
Dec 01, 2020
Sebastien Montella, Betty Fabre, Tanguy Urvoy, Johannes Heinecke, Lina Rojas-Barahona

Figure 1 for Denoising Pre-Training and Data Augmentation Strategies for Enhanced RDF Verbalization with Transformers
Figure 2 for Denoising Pre-Training and Data Augmentation Strategies for Enhanced RDF Verbalization with Transformers
Viaarxiv icon

Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation

Add code
Bookmark button
Alert button
Nov 25, 2020
Thibault Cordier, Tanguy Urvoy, Lina M. Rojas-Barahona, Fabrice Lefèvre

Figure 1 for Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Figure 2 for Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Figure 3 for Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Figure 4 for Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation
Viaarxiv icon

Scaling up budgeted reinforcement learning

Add code
Bookmark button
Alert button
Mar 06, 2019
Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin

Figure 1 for Scaling up budgeted reinforcement learning
Figure 2 for Scaling up budgeted reinforcement learning
Figure 3 for Scaling up budgeted reinforcement learning
Figure 4 for Scaling up budgeted reinforcement learning
Viaarxiv icon

Corrupt Bandits for Preserving Local Privacy

Add code
Bookmark button
Alert button
Nov 02, 2017
Pratik Gajane, Tanguy Urvoy, Emilie Kaufmann

Figure 1 for Corrupt Bandits for Preserving Local Privacy
Figure 2 for Corrupt Bandits for Preserving Local Privacy
Figure 3 for Corrupt Bandits for Preserving Local Privacy
Figure 4 for Corrupt Bandits for Preserving Local Privacy
Viaarxiv icon

Random Forest for the Contextual Bandit Problem - extended version

Add code
Bookmark button
Alert button
Sep 15, 2016
Raphaël Féraud, Robin Allesiardo, Tanguy Urvoy, Fabrice Clérot

Figure 1 for Random Forest for the Contextual Bandit Problem - extended version
Figure 2 for Random Forest for the Contextual Bandit Problem - extended version
Figure 3 for Random Forest for the Contextual Bandit Problem - extended version
Figure 4 for Random Forest for the Contextual Bandit Problem - extended version
Viaarxiv icon

Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation

Add code
Bookmark button
Alert button
Jan 18, 2016
Artem Sokolov, Stefan Riezler, Tanguy Urvoy

Figure 1 for Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation
Figure 2 for Bandit Structured Prediction for Learning from Partial Feedback in Statistical Machine Translation
Viaarxiv icon

A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits

Add code
Bookmark button
Alert button
Jan 15, 2016
Pratik Gajane, Tanguy Urvoy, Fabrice Clérot

Figure 1 for A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits
Figure 2 for A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits
Figure 3 for A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits
Figure 4 for A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits
Viaarxiv icon

Utility-based Dueling Bandits as a Partial Monitoring Game

Add code
Bookmark button
Alert button
Sep 25, 2015
Pratik Gajane, Tanguy Urvoy

Figure 1 for Utility-based Dueling Bandits as a Partial Monitoring Game
Figure 2 for Utility-based Dueling Bandits as a Partial Monitoring Game
Figure 3 for Utility-based Dueling Bandits as a Partial Monitoring Game
Figure 4 for Utility-based Dueling Bandits as a Partial Monitoring Game
Viaarxiv icon