Alert button
Picture for Siddharth Verma

Siddharth Verma

Alert button

Suppressing Pink Elephants with Direct Principle Feedback

Add code
Bookmark button
Alert button
Feb 13, 2024
Louis Castricato, Nathan Lile, Suraj Anand, Hailey Schoelkopf, Siddharth Verma, Stella Biderman

Viaarxiv icon

OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

Add code
Bookmark button
Alert button
May 19, 2023
Badr AlKhamissi, Siddharth Verma, Ping Yu, Zhijing Jin, Asli Celikyilmaz, Mona Diab

Figure 1 for OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Figure 2 for OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Figure 3 for OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Figure 4 for OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Viaarxiv icon

Uniform Masking Prevails in Vision-Language Pretraining

Add code
Bookmark button
Alert button
Dec 10, 2022
Siddharth Verma, Yuchen Lu, Rui Hou, Hanchao Yu, Nicolas Ballas, Madian Khabsa, Amjad Almahairi

Figure 1 for Uniform Masking Prevails in Vision-Language Pretraining
Figure 2 for Uniform Masking Prevails in Vision-Language Pretraining
Figure 3 for Uniform Masking Prevails in Vision-Language Pretraining
Figure 4 for Uniform Masking Prevails in Vision-Language Pretraining
Viaarxiv icon

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 18, 2022
Siddharth Verma, Justin Fu, Mengjiao Yang, Sergey Levine

Figure 1 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 2 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 3 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Figure 4 for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Viaarxiv icon

Continual Learning of Control Primitives: Skill Discovery via Reset-Games

Add code
Bookmark button
Alert button
Nov 10, 2020
Kelvin Xu, Siddharth Verma, Chelsea Finn, Sergey Levine

Figure 1 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 2 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 3 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Figure 4 for Continual Learning of Control Primitives: Skill Discovery via Reset-Games
Viaarxiv icon

Fast Online "Next Best Offers" using Deep Learning

Add code
Bookmark button
Alert button
May 31, 2019
Rekha Singhal, Gautam Shroff, Mukund Kumar, Sharod Roy, Sanket Kadarkar, Rupinder virk, Siddharth Verma, Vartika Tiwari

Figure 1 for Fast Online "Next Best Offers" using Deep Learning
Figure 2 for Fast Online "Next Best Offers" using Deep Learning
Figure 3 for Fast Online "Next Best Offers" using Deep Learning
Figure 4 for Fast Online "Next Best Offers" using Deep Learning
Viaarxiv icon