Alert button
Picture for Maxine Eskenazi

Maxine Eskenazi

Alert button

Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation

Jan 27, 2023
Jessica Huynh, Cathy Jiao, Prakhar Gupta, Shikib Mehri, Payal Bajaj, Vishrav Chaudhary, Maxine Eskenazi

Figure 1 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 2 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 3 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Figure 4 for Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation
Viaarxiv icon

The DialPort tools

Aug 18, 2022
Jessica Huynh, Shikib Mehri, Cathy Jiao, Maxine Eskenazi

Figure 1 for The DialPort tools
Figure 2 for The DialPort tools
Figure 3 for The DialPort tools
Figure 4 for The DialPort tools
Viaarxiv icon

Interactive Evaluation of Dialog Track at DSTC9

Jul 28, 2022
Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David Traum, Maxine Eskenazi

Figure 1 for Interactive Evaluation of Dialog Track at DSTC9
Figure 2 for Interactive Evaluation of Dialog Track at DSTC9
Figure 3 for Interactive Evaluation of Dialog Track at DSTC9
Figure 4 for Interactive Evaluation of Dialog Track at DSTC9
Viaarxiv icon

LAD: Language Models as Data for Zero-Shot Dialog

Jul 28, 2022
Shikib Mehri, Yasemin Altun, Maxine Eskenazi

Figure 1 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 2 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 3 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 4 for LAD: Language Models as Data for Zero-Shot Dialog
Viaarxiv icon

DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

Jul 25, 2022
Jessica Huynh, Ting-Rui Chiang, Jeffrey Bigham, Maxine Eskenazi

Figure 1 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 2 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 3 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 4 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Viaarxiv icon

Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

May 25, 2022
Prakhar Gupta, Cathy Jiao, Yi-Ting Yeh, Shikib Mehri, Maxine Eskenazi, Jeffrey P. Bigham

Figure 1 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 2 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 3 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 4 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Viaarxiv icon

Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

Mar 18, 2022
Shikib Mehri, Jinho Choi, Luis Fernando D'Haro, Jan Deriu, Maxine Eskenazi, Milica Gasic, Kallirroi Georgila, Dilek Hakkani-Tur, Zekang Li, Verena Rieser, Samira Shaikh, David Traum, Yi-Ting Yeh, Zhou Yu, Yizhe Zhang, Chen Zhang

Figure 1 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Figure 2 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Viaarxiv icon

A Survey of NLP-Related Crowdsourcing HITs: what works and what does not

Nov 09, 2021
Jessica Huynh, Jeffrey Bigham, Maxine Eskenazi

Figure 1 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 2 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 3 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 4 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Viaarxiv icon

A Comprehensive Assessment of Dialog Evaluation Metrics

Jun 30, 2021
Yi-Ting Yeh, Maxine Eskenazi, Shikib Mehri

Figure 1 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 2 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 3 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 4 for A Comprehensive Assessment of Dialog Evaluation Metrics
Viaarxiv icon

Schema-Guided Paradigm for Zero-Shot Dialog

Jun 13, 2021
Shikib Mehri, Maxine Eskenazi

Figure 1 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 2 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 3 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 4 for Schema-Guided Paradigm for Zero-Shot Dialog
Viaarxiv icon