Alert button
Picture for Sean Welleck

Sean Welleck

Alert button

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Add code
Bookmark button
Alert button
Mar 14, 2024
Zhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan

Figure 1 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 2 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 3 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Figure 4 for Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Viaarxiv icon

STEER: Unified Style Transfer with Expert Reinforcement

Add code
Bookmark button
Alert button
Nov 13, 2023
Skyler Hallinan, Faeze Brahman, Ximing Lu, Jaehun Jung, Sean Welleck, Yejin Choi

Viaarxiv icon

LLMSTEP: LLM proofstep suggestions in Lean

Add code
Bookmark button
Alert button
Oct 27, 2023
Sean Welleck, Rahul Saha

Viaarxiv icon

Llemma: An Open Language Model For Mathematics

Add code
Bookmark button
Alert button
Oct 16, 2023
Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck

Figure 1 for Llemma: An Open Language Model For Mathematics
Figure 2 for Llemma: An Open Language Model For Mathematics
Figure 3 for Llemma: An Open Language Model For Mathematics
Figure 4 for Llemma: An Open Language Model For Mathematics
Viaarxiv icon

Faith and Fate: Limits of Transformers on Compositionality

Add code
Bookmark button
Alert button
Jun 01, 2023
Nouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena D. Hwang, Soumya Sanyal, Sean Welleck, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi

Figure 1 for Faith and Fate: Limits of Transformers on Compositionality
Figure 2 for Faith and Fate: Limits of Transformers on Compositionality
Figure 3 for Faith and Fate: Limits of Transformers on Compositionality
Figure 4 for Faith and Fate: Limits of Transformers on Compositionality
Viaarxiv icon

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

Add code
Bookmark button
Alert button
May 24, 2023
Ximing Lu, Faeze Brahman, Peter West, Jaehun Jang, Khyathi Chandu, Abhilasha Ravichander, Lianhui Qin, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Yuchen Lin, Skyler Hallinan, Xiang Ren, Sean Welleck, Yejin Choi

Figure 1 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 2 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 3 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Figure 4 for Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
Viaarxiv icon

Self-Refine: Iterative Refinement with Self-Feedback

Add code
Bookmark button
Alert button
Mar 30, 2023
Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark

Figure 1 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 2 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 3 for Self-Refine: Iterative Refinement with Self-Feedback
Figure 4 for Self-Refine: Iterative Refinement with Self-Feedback
Viaarxiv icon

MAUVE Scores for Generative Models: Theory and Practice

Add code
Bookmark button
Alert button
Dec 30, 2022
Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta, Rowan Zellers, Sewoong Oh, Yejin Choi, Zaid Harchaoui

Figure 1 for MAUVE Scores for Generative Models: Theory and Practice
Figure 2 for MAUVE Scores for Generative Models: Theory and Practice
Figure 3 for MAUVE Scores for Generative Models: Theory and Practice
Figure 4 for MAUVE Scores for Generative Models: Theory and Practice
Viaarxiv icon

A Survey of Deep Learning for Mathematical Reasoning

Add code
Bookmark button
Alert button
Dec 20, 2022
Pan Lu, Liang Qiu, Wenhao Yu, Sean Welleck, Kai-Wei Chang

Figure 1 for A Survey of Deep Learning for Mathematical Reasoning
Figure 2 for A Survey of Deep Learning for Mathematical Reasoning
Figure 3 for A Survey of Deep Learning for Mathematical Reasoning
Figure 4 for A Survey of Deep Learning for Mathematical Reasoning
Viaarxiv icon

Generating Sequences by Learning to Self-Correct

Add code
Bookmark button
Alert button
Oct 31, 2022
Sean Welleck, Ximing Lu, Peter West, Faeze Brahman, Tianxiao Shen, Daniel Khashabi, Yejin Choi

Figure 1 for Generating Sequences by Learning to Self-Correct
Figure 2 for Generating Sequences by Learning to Self-Correct
Figure 3 for Generating Sequences by Learning to Self-Correct
Figure 4 for Generating Sequences by Learning to Self-Correct
Viaarxiv icon