Alert button
Picture for Rebecca Qian

Rebecca Qian

Alert button

FinanceBench: A New Benchmark for Financial Question Answering

Add code
Bookmark button
Alert button
Nov 20, 2023
Pranab Islam, Anand Kannappan, Douwe Kiela, Rebecca Qian, Nino Scherrer, Bertie Vidgen

Viaarxiv icon

SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Bertie Vidgen, Hannah Rose Kirk, Rebecca Qian, Nino Scherrer, Anand Kannappan, Scott A. Hale, Paul Röttger

Viaarxiv icon

Step by Step to Fairness: Attributing Societal Bias in Task-oriented Dialogue Systems

Add code
Bookmark button
Alert button
Nov 14, 2023
Hsuan Su, Rebecca Qian, Chinnadhurai Sankar, Shahin Shayandeh, Shang-Tse Chen, Hung-yi Lee, Daniel M. Bikel

Viaarxiv icon

Perturbation Augmentation for Fairer NLP

Add code
Bookmark button
Alert button
May 25, 2022
Rebecca Qian, Candace Ross, Jude Fernandes, Eric Smith, Douwe Kiela, Adina Williams

Figure 1 for Perturbation Augmentation for Fairer NLP
Figure 2 for Perturbation Augmentation for Fairer NLP
Figure 3 for Perturbation Augmentation for Fairer NLP
Figure 4 for Perturbation Augmentation for Fairer NLP
Viaarxiv icon

Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction

Add code
Bookmark button
Alert button
Apr 19, 2022
Yuxuan Sun, Ethan Carlson, Rebecca Qian, Kavya Srinet, Arthur Szlam

Figure 1 for Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction
Figure 2 for Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction
Figure 3 for Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction
Figure 4 for Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction
Viaarxiv icon

Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents

Add code
Bookmark button
Alert button
Jan 12, 2022
Eric Michael Smith, Orion Hsu, Rebecca Qian, Stephen Roller, Y-Lan Boureau, Jason Weston

Figure 1 for Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Figure 2 for Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Figure 3 for Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Figure 4 for Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Viaarxiv icon

droidlet: modular, heterogenous, multi-modal agents

Add code
Bookmark button
Alert button
Jan 25, 2021
Anurag Pratik, Soumith Chintala, Kavya Srinet, Dhiraj Gandhi, Rebecca Qian, Yuxuan Sun, Ryan Drew, Sara Elkafrawy, Anoushka Tiwari, Tucker Hart, Mary Williamson, Abhinav Gupta, Arthur Szlam

Figure 1 for droidlet: modular, heterogenous, multi-modal agents
Figure 2 for droidlet: modular, heterogenous, multi-modal agents
Figure 3 for droidlet: modular, heterogenous, multi-modal agents
Figure 4 for droidlet: modular, heterogenous, multi-modal agents
Viaarxiv icon