Alert button
Picture for Alane Suhr

Alane Suhr

Alert button

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
Bookmark button
Alert button
May 17, 2024
Yuexiang Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, Sergey Levine

Viaarxiv icon

Autonomous Evaluation and Refinement of Digital Agents

Add code
Bookmark button
Alert button
Apr 10, 2024
Jiayi Pan, Yichi Zhang, Nicholas Tomlin, Yifei Zhou, Sergey Levine, Alane Suhr

Viaarxiv icon

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Add code
Bookmark button
Alert button
Nov 14, 2023
Wenting Zhao, Justin T Chiu, Jena D. Hwang, Faeze Brahman, Jack Hessel, Sanjiban Choudhury, Yejin Choi, Xiang Lorraine Li, Alane Suhr

Viaarxiv icon

What's In My Big Data?

Add code
Bookmark button
Alert button
Oct 31, 2023
Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge

Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting

Add code
Bookmark button
Alert button
Oct 17, 2023
Melanie Sclar, Yejin Choi, Yulia Tsvetkov, Alane Suhr

Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Add code
Bookmark button
Alert button
Jun 02, 2023
Zeqiu Wu, Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf, Hannaneh Hajishirzi

Figure 1 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 2 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 3 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Figure 4 for Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Viaarxiv icon

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

Add code
Bookmark button
Alert button
Jun 01, 2023
Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi, Yulia Tsvetkov

Figure 1 for Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker
Figure 2 for Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker
Figure 3 for Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker
Figure 4 for Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker
Viaarxiv icon

We're Afraid Language Models Aren't Modeling Ambiguity

Add code
Bookmark button
Alert button
Apr 27, 2023
Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

Figure 1 for We're Afraid Language Models Aren't Modeling Ambiguity
Figure 2 for We're Afraid Language Models Aren't Modeling Ambiguity
Figure 3 for We're Afraid Language Models Aren't Modeling Ambiguity
Figure 4 for We're Afraid Language Models Aren't Modeling Ambiguity
Viaarxiv icon

Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling

Add code
Bookmark button
Alert button
Jan 28, 2023
Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox

Figure 1 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 2 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 3 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 4 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Viaarxiv icon

Continual Learning for Instruction Following from Realtime Feedback

Add code
Bookmark button
Alert button
Dec 19, 2022
Alane Suhr, Yoav Artzi

Figure 1 for Continual Learning for Instruction Following from Realtime Feedback
Figure 2 for Continual Learning for Instruction Following from Realtime Feedback
Figure 3 for Continual Learning for Instruction Following from Realtime Feedback
Figure 4 for Continual Learning for Instruction Following from Realtime Feedback
Viaarxiv icon