Alert button
Picture for Moya Chen

Moya Chen

Alert button

Llama 2: Open Foundation and Fine-Tuned Chat Models

Jul 19, 2023
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges

Jun 07, 2023
Morteza Behrooz, William Ngan, Joshua Lane, Giuliano Morse, Benjamin Babcock, Kurt Shuster, Mojtaba Komeili, Moya Chen, Melanie Kambadur, Y-Lan Boureau, Jason Weston

Figure 1 for The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges
Figure 2 for The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges
Viaarxiv icon

A Theory on Adam Instability in Large-Scale Machine Learning

Apr 25, 2023
Igor Molybog, Peter Albert, Moya Chen, Zachary DeVito, David Esiobu, Naman Goyal, Punit Singh Koura, Sharan Narang, Andrew Poulton, Ruan Silva, Binh Tang, Diana Liskovich, Puxin Xu, Yuchen Zhang, Melanie Kambadur, Stephen Roller, Susan Zhang

Figure 1 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 2 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 3 for A Theory on Adam Instability in Large-Scale Machine Learning
Figure 4 for A Theory on Adam Instability in Large-Scale Machine Learning
Viaarxiv icon

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Dec 15, 2022
Olga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz

Figure 1 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 2 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 3 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Figure 4 for ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
Viaarxiv icon

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

Aug 10, 2022
Kurt Shuster, Jing Xu, Mojtaba Komeili, Da Ju, Eric Michael Smith, Stephen Roller, Megan Ung, Moya Chen, Kushal Arora, Joshua Lane, Morteza Behrooz, William Ngan, Spencer Poff, Naman Goyal, Arthur Szlam, Y-Lan Boureau, Melanie Kambadur, Jason Weston

Figure 1 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 2 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 3 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 4 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Viaarxiv icon

OPT: Open Pre-trained Transformer Language Models

May 05, 2022
Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer

Figure 1 for OPT: Open Pre-trained Transformer Language Models
Figure 2 for OPT: Open Pre-trained Transformer Language Models
Figure 3 for OPT: Open Pre-trained Transformer Language Models
Figure 4 for OPT: Open Pre-trained Transformer Language Models
Viaarxiv icon

Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue

Oct 13, 2021
Moya Chen, Paul A. Crook, Stephen Roller

Figure 1 for Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue
Figure 2 for Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue
Figure 3 for Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue
Figure 4 for Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue
Viaarxiv icon

Retrieval Augmentation Reduces Hallucination in Conversation

Apr 15, 2021
Kurt Shuster, Spencer Poff, Moya Chen, Douwe Kiela, Jason Weston

Figure 1 for Retrieval Augmentation Reduces Hallucination in Conversation
Figure 2 for Retrieval Augmentation Reduces Hallucination in Conversation
Figure 3 for Retrieval Augmentation Reduces Hallucination in Conversation
Figure 4 for Retrieval Augmentation Reduces Hallucination in Conversation
Viaarxiv icon