Picture for Makesh Narsimhan Sreedhar

Makesh Narsimhan Sreedhar

Unsupervised Extraction of Dialogue Policies from Conversations

Add code
Jun 21, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Add code
Apr 04, 2024
Viaarxiv icon

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

Add code
Nov 16, 2023
Figure 1 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 2 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 3 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 4 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Add code
Oct 09, 2023
Figure 1 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 2 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 3 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Figure 4 for SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Viaarxiv icon

Prompt Learning for Domain Adaptation in Task-Oriented Dialogue

Add code
Nov 10, 2022
Figure 1 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 2 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 3 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Figure 4 for Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
Viaarxiv icon

Local Byte Fusion for Neural Machine Translation

Add code
May 23, 2022
Figure 1 for Local Byte Fusion for Neural Machine Translation
Figure 2 for Local Byte Fusion for Neural Machine Translation
Figure 3 for Local Byte Fusion for Neural Machine Translation
Figure 4 for Local Byte Fusion for Neural Machine Translation
Viaarxiv icon

Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback

Add code
Oct 15, 2020
Figure 1 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 2 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 3 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Figure 4 for Learning Improvised Chatbots from Adversarial Modifications of Natural Language Feedback
Viaarxiv icon