Picture for Maxine Eskenazi

Maxine Eskenazi

EJ

Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation

Add code
Jan 27, 2023
Figure 1 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 2 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 3 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 4 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Viaarxiv icon

The DialPort tools

Add code
Aug 18, 2022
Figure 1 for The DialPort tools
Figure 2 for The DialPort tools
Figure 3 for The DialPort tools
Figure 4 for The DialPort tools
Viaarxiv icon

Interactive Evaluation of Dialog Track at DSTC9

Add code
Jul 28, 2022
Figure 1 for Interactive Evaluation of Dialog Track at DSTC9
Figure 2 for Interactive Evaluation of Dialog Track at DSTC9
Figure 3 for Interactive Evaluation of Dialog Track at DSTC9
Figure 4 for Interactive Evaluation of Dialog Track at DSTC9
Viaarxiv icon

LAD: Language Models as Data for Zero-Shot Dialog

Add code
Jul 28, 2022
Figure 1 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 2 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 3 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 4 for LAD: Language Models as Data for Zero-Shot Dialog
Viaarxiv icon

DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

Add code
Jul 25, 2022
Figure 1 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 2 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 3 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 4 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Viaarxiv icon

Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

Add code
May 25, 2022
Figure 1 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 2 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 3 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 4 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Viaarxiv icon

Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

Add code
Mar 18, 2022
Figure 1 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Figure 2 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Viaarxiv icon

A Survey of NLP-Related Crowdsourcing HITs: what works and what does not

Add code
Nov 09, 2021
Figure 1 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 2 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 3 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 4 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Viaarxiv icon

A Comprehensive Assessment of Dialog Evaluation Metrics

Add code
Jun 30, 2021
Figure 1 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 2 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 3 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 4 for A Comprehensive Assessment of Dialog Evaluation Metrics
Viaarxiv icon

Schema-Guided Paradigm for Zero-Shot Dialog

Add code
Jun 13, 2021
Figure 1 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 2 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 3 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 4 for Schema-Guided Paradigm for Zero-Shot Dialog
Viaarxiv icon