Alert button
Picture for Prajjwal Bhargava

Prajjwal Bhargava

Alert button

Effective Long-Context Scaling of Foundation Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
May 26, 2023
Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard, Shagun Sodhani, Amy Zhang

Figure 1 for Sequence Modeling is a Robust Contender for Offline Reinforcement Learning
Figure 2 for Sequence Modeling is a Robust Contender for Offline Reinforcement Learning
Figure 3 for Sequence Modeling is a Robust Contender for Offline Reinforcement Learning
Figure 4 for Sequence Modeling is a Robust Contender for Offline Reinforcement Learning
Viaarxiv icon

AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model

Add code
Bookmark button
Alert button
Mar 10, 2023
Prajjwal Bhargava, Pooyan Amini, Shahin Shayandeh, Chinnadhurai Sankar

Figure 1 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 2 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 3 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 4 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Viaarxiv icon

DiscoSense: Commonsense Reasoning with Discourse Connectives

Add code
Bookmark button
Alert button
Oct 22, 2022
Prajjwal Bhargava, Vincent Ng

Figure 1 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 2 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 3 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 4 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Viaarxiv icon

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey

Add code
Bookmark button
Alert button
Jan 28, 2022
Prajjwal Bhargava, Vincent Ng

Figure 1 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 2 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 3 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 4 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Viaarxiv icon

Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics

Add code
Bookmark button
Alert button
Oct 04, 2021
Prajjwal Bhargava, Aleksandr Drozd, Anna Rogers

Figure 1 for Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Figure 2 for Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Figure 3 for Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Figure 4 for Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Viaarxiv icon

Adaptive Transformers for Learning Multimodal Representations

Add code
Bookmark button
Alert button
Jun 10, 2020
Prajjwal Bhargava

Figure 1 for Adaptive Transformers for Learning Multimodal Representations
Figure 2 for Adaptive Transformers for Learning Multimodal Representations
Figure 3 for Adaptive Transformers for Learning Multimodal Representations
Figure 4 for Adaptive Transformers for Learning Multimodal Representations
Viaarxiv icon