Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

OPT: Open Pre-trained Transformer Language Models


May 05, 2022
Susan Zhang , Stephen Roller , Naman Goyal , Mikel Artetxe , Moya Chen , Shuohui Chen , Christopher Dewan , Mona Diab , Xian Li , Xi Victoria Lin , Todor Mihaylov , Myle Ott , Sam Shleifer , Kurt Shuster , Daniel Simig , Punit Singh Koura , Anjali Sridhar , Tianlu Wang , Luke Zettlemoyer


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion


Mar 29, 2022
Kurt Shuster , Mojtaba Komeili , Leonard Adolphs , Stephen Roller , Arthur Szlam , Jason Weston


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents


Jan 12, 2022
Eric Michael Smith , Orion Hsu , Rebecca Qian , Stephen Roller , Y-Lan Boureau , Jason Weston


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue


Oct 13, 2021
Moya Chen , Paul A. Crook , Stephen Roller


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Hash Layers For Large Sparse Models


Jun 16, 2021
Stephen Roller , Sainbayar Sukhbaatar , Arthur Szlam , Jason Weston


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Staircase Attention for Recurrent Processing of Sequences


Jun 08, 2021
Da Ju , Stephen Roller , Sainbayar Sukhbaatar , Jason Weston


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Not All Memories are Created Equal: Learning to Forget by Expiring


May 13, 2021
Sainbayar Sukhbaatar , Da Ju , Spencer Poff , Stephen Roller , Arthur Szlam , Jason Weston , Angela Fan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adding Chit-Chats to Enhance Task-Oriented Dialogues


Oct 24, 2020
Kai Sun , Seungwhan Moon , Paul Crook , Stephen Roller , Becka Silvert , Bing Liu , Zhiguang Wang , Honglei Liu , Eunjoon Cho , Claire Cardie


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions


Jul 13, 2020
Stephen Roller , Y-Lan Boureau , Jason Weston , Antoine Bordes , Emily Dinan , Angela Fan , David Gunning , Da Ju , Margaret Li , Spencer Poff , Pratik Ringshia , Kurt Shuster , Eric Michael Smith , Arthur Szlam , Jack Urbanek , Mary Williamson


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>